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PREFACE 


A DEAFT of these lectures, written before they were delivered, 
contained considerably more technicalities and mathematics than 
the present text. Facing a large audience in which physicists 
and mathematicians were presumably a minority, I had to 
change my plans and to improvise a simplified presentation. 
Though this did not seem difficult on the platform of the Hall 
of Magdalen College, Oxford, the final formulation for publi¬ 
cation was not an easy task. I did not like replacing rigorous 
mathematical reasoning by that mixture of literary style, 
authority, and mystery which is often used by popularizing and 
philosophizing scientists. Thus, the idea occurred to me to 
preserve the mathematics by removing it to a detailed appendix 
which could also contain references to the hterature. The vast 
extension of the latter, however, compelled me to restrict quota¬ 
tions to recent publications which are not in the text-books. 
Some of these supplements contain unpublished investigations 
of my school, mainly by my collaborator Dr. H. S. Green. In the 
text itself I have given up the original division into seven lec¬ 
tures and replaced it by a more natural arrangement into ten 
chapters. 

I have to thank Dr. Green for his untiring help in reading, 
criticizing, and correcting my script, working out drafts of the 
appendix, and reading proofs. I am also indebted to Mr. Lewis 
Elton not only for proof-reading but for carefully preparing the 
index. I have further to thank Albert Einstein for permission 
to publish sections of two of his letters. 

My most sincere gratitude is due to the President and the 
Fellows of Magdalen College who gave me the opportunity to 
plan these lectures, and the leisure to write them down for 
publication. 

I wish to thank the Oxford University Press for the excellent 
printing and their willingness to follow all my wishes. 


M. B. 
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INTRODUCTION 

The notions of cause and chance which I propose to deal with 
in these lectures are not specifically physical concepts but have 
a much wider meaning and application. They are used, more or 
less vaguely, in everyday life. They appear, not only in all 
branches of science, but also in history, psychology, philosophy, 
and theology; everywhere with a different shade of meaning. 
It would be far beyond my abilities to give an account of all 
these usages, or to attempt an analysis of the exact significance 
of the words cause ’ and ‘ chance ’ in each of them. However, 
it is obvious that there must be a common feature in the use of 
these notions, like the theme in a set of variations. Indeed, 
cause expresses the idea of necessity in the relation of events, 
while chance means just the opposite, complete randomness. 
Nature, as well as human affairs, seems to be subject to both 
necessity and accident. Yet even accident is not completely 
arbitrary, for there are laws of chance, formulated in the mathe¬ 
matical theory of probability, nor can the cause-effect relation 
be used for predicting the future with certainty, as this would 
require a complete knowledge of the relevant circumstances, 
present, past, or both together, which is not available. There 
seems to be a hopeless tangle of ideas. In fact, if you look 
through the literature on this problem you will find no satisfac¬ 
tory solution, no general agreement. Only in physics has a 
systematic attempt been made to use the notions of cause and 
chance in a way free from contradictions. Physicists form their 
notions through the interpretation of experiments. This method 
may rightly be called Natural Philosophy, a word still used for 
physics at the Scottish universities. In this sense I shall attempt 
to investigate the concepts of cause and chance in these lectures. 
My material will be taken mainly from physics, but I shall try 
to regard it with the attitude of the philosopher, and I hope that 
the results obtained will be of use wherever the concepts of 
cause and chance are applied. I know that such an attempt will 
not find favour with some philosophers, who maintain that 

5131 -o 
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science teaches only a narrow aspect of the world., and one wliicli 
is of no great importance to man’s mind. It is true that many 
scientists are not philosophically minded and have hitherto 
shown much skill and ingenuity hut little wisdom. I need hardly 
to enlarge on this subject. The practical applications of science 
have given us the means of a fuller and richer life, but also the 
means of destruction and devastation on a vast scale. Wise 
men would have considered the consequences of their activities 
before starting on them; scientists have failed to do so, and only 
recently have they become conscious of their responsibilities to 
society. They have gained prestige as men of action, but they 
have lost credit as philosophers. Yet history show's that science 
has played a leading part in the development of human thought. 


It has not only supplied raw material to philosophy by gathering 
facts, but also evolved the fundamental concepts on how to deal 
with them. It suffices to mention the Copernican system of the 
universe, and the Newtonian dynamics which sprang from it. 
These originated the conceptions of space, time, matter, force, 
and motion for a long time to come, and had a mighty influence 
on many philosophical systems. It has been said that the meta¬ 
physics of any period is the offspring of the physics of the pre¬ 
ceding period. If this is trne, it puts us physicists under the 
obhgation to explain onr ideas in a not-too-technical language. 
This IS the purpose of the following lectures on a restricted 
though important field. I have made an attempt to avoid 
mathematics entirely, but failed. It would have meant an un¬ 
bearable clumsiness of expression and loss of clarity. A way out 
would have been the reduction of all higher mathematics to 
elementary methods in EucHdean style—foUowiug the cele- 
rated example of Newton’s Principia, But this would even 
have increased the clumsiness and destroyed what there is of 
aestheti^ppeal. I personaUy think that more than 200 years 
Me^on^ere shouldbe some progress in the assimilatL of 
mathematics by those who are interested in natural philosophy. 
So I shaU use ordmary language and formulae in a suitaWe 
mixture, hut I shall not give proofs of fKoo.... /.i. 

coUeeted in the Appendix). theorems (they are 
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In this way I hope to explain how physics may throw some 
light on a problem which is not only important for abstract 
knowledge but also for the behaviour of man. An unrestricted 
belief in causality leads necessarily to the idea that the world is 
an automaton of which we ourselves are only little cog-wheels. 
This means materialistic determinism. It resembles very much 
that religious determinism accepted by different creeds, where 
the actions of men are believed to be determined from the 
beginning by a ruling of God. I cannot enlarge on the difficulties 
to which this idea leads if considered from the standpoint of 
ethical responsibility. The notion of divine predestination 
clashes with the notion of free will, in the same way as the 
assumption of an endless chain of natural causes. On the other 
hand, an unrestricted belief in chance is impossible, as it cannot 
be denied that there are a gi*eat many regularities in the world; 
hence there can be, at most, ‘regulated accident’. One has to 
postulate laws of chance which assume the appearance of laws 
of nature or laws for human behaviour. Such a philosophy 
would give ample space for free will, or even for the willed 
actions of gods and demons. In fact, all primitive polytheistic 
religions seem to be based on such a conception of nature: things 
happening in a haphazard way, except where some spirit inter¬ 
feres with a purpose. We reject to-day this demonological 
philosophy, but admit chance into the realm of exact science. 
Our philosophy is dualistic in this respect; nature is ruled by 
laws of cause and laws of chance in a certain mixture. How is 
this possible ? Are there no logical contradictions ? Can this 
mixture of ideas be cast into a consistent system in which all 
phenomena can be adequately described or explained ? What 
do we mean by such an explanation if the feature of chance is 
involved ? What are the irreducible or metaphysical principles 
involved ? Is there any room in this system for free will or for 
the interference of deity ? These and many other questions can 
be asked. I shall try to answer some of them from the stand¬ 
point of the physicist, others from my philosophical convictions 
which are not much more than common sense improved by 
sporadic reading. The statement, frequently made, that modern 
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physics has given up causality is entirely unfounded. Modern 
physics, it is trae, has given up or modified many traditional 
ideas; hut it would cease to he a science if it had given up the 
search for the causes of phenomena. I found it necessary, 
therefore, to formulate the different aspects of the fundamental 
notions by giving definitions of terms which seem to me in agree¬ 
ment with ordinary language. With the help of these concepts, 
I shall survey the development of physical thought, dwelling 
here and there on special points of interest, and I shall try to 
apply the results to philosophy in general. 



II 

CAUSALITY AND DETERMINISM 


The concept of causality is closely linked with that of determin¬ 
ism, yet they seem to me not identical. Moreover, causality is 
used with several different shades of meaning. I shall try to dis¬ 
entangle these notions and eventually sum them up in definitions. 

The cause-effect relation is used mainly in two ways; I shall 
illustrate this by giving examples, partly from ordinary life, 
partly from science. Take these statements: 

‘Overpopulation is the cause of India’s poverty.’ 

‘The stability of British politics is caused by the institution 
of monarchy.’ 

‘Wars are caused by the economic conditions.’ 

‘There is no life on the moon because of the lack of an 
atmosphere containing oxygen.’ 

‘Chemical reactions are caused by the affinity of molecules.’ 

The common feature to which I wish to draw your attention 
is the fa-ct that these sentences state timeless lelations. They 
say that one thing or one situation A causes another B, meaning 
apparently that the existence of B depends on A, or that if A 
were changed or absent, B would also be changed or absent. 
Compare these statements with the following : 

‘The Indian famine of 1946 was caused by a bad harvest.’ 

‘The fall of Hitler was caused by the defeat of his armies.’ 

‘The American war of secession was caused by the economic 
situation of the slave states.’ 

‘Life could develop on earth because of the formation of an 
atmosphere containing oxygen.’ 

‘The destruction of Hiroshima was caused by the ex])losion 
of an atomic bomb.’ 

In these sentences one definite event A is regai'ded m the 
cause of another B; both events are more or less fixed in space 
and time. I think that these two different shades of the cause- 
effect relation are both perfectly legitimate. The common factor 
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is tlie idea of dependence, which, needs some comment. This 
concept of dependence is clear enough if the two things con¬ 
nected are concepts themselves, things of the mind, like two 
numbers or two sets of numbers; then dependence means what 
the mathematician expresses by the word ‘function’. This 
logical dependence needs no further analysis (I even think it 
caimot be further analysed). But causality does not refer to 
logical dependence; it means dependence of real things of nature 
on one another. The problem of what this means is not simple 
at all. Astrologers claim the dependence of the fate of human 
beings on the constellations of stars. Scientists reject such state¬ 
ments—^but why? Because science accepts only relations of 
dependence if they can be verified by observation and experi¬ 
ment, and we are convinced that astrology has not stood this 
test. Science insists on a criterion for dependence, namely 
repetitive observation or experiment: either the things A and JB 
refer to phenomena, occurring repeatedly in Nature and being 
sufficiently similar for the aspect in question to be considered 
as identical; or repetition can be artificially produced by 
experiment. 

Observation and experiment are crafts which are systemati¬ 
cally taught. Sometimes, by a genius, they are raised to the 
level of an art. There are rules to be observed: isolation of the 
system considered, restriction of the variable factors, varying 
of the conditions until the dependence of the effect on a single 
factor becomes evident; in many cases exact measurements and 
comparison of figures are essential. The technique of handling 
these figures is a craft in itself, in which the notions of chance 
and probability play a decisive part—we shall return to this 
question at a later stage. So it looks as if science has a methodical 
way of finding causal relations without referring to any meta¬ 
physical principle. But this is a deception. For no observation 
or experiment, however extended, can give more than a finite 
number of repetitions, and the statement of a law —B depends on 
A —always transcends experience. Yet this kind of state¬ 
ment is made everywhere and all the time, and sometimes from 
scanty material. Philosophers call it Inference by Induction, 
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nd have developed many a profound theory of it. I shall not 
nter into a discussion of these speculations. But I have to 
lake it clear why I distinguish this principle of induction from 
ausality. Induction allows oneto generalize a number of observa- 
ions into a general rule: that night follows day and day follows 
ight, or that in spring the trees grow green leaves, are induc- 
Lons, but they contain no causal relation, no statement of 
ependence. The method of inductive thinking is more general 
lan causal thinking; it is used in everyday life as a matter of 
Durse, and it applies in science to the descriptive and experi- 
lental branches as well. But while everyday life has no definite 
:iterion for the validity of an induction and relies more or less 
a intuition, science has worked out a code, or rule of craft, for 
s application. This code has been entirely successful, and I 
link that is the only justification for it—just as the rules of the 
'aft of classical music are only justified by full audiences and 
pplause. Science and art are not so different as they appear, 
he laws in the realms of truth and beauty are laid down by the 
tasters, who create eternal works. 

Absolute values are ideals never reached. Yet I think that 
le common effort of mankind has approached some ideals in 
aite a respectable way. I do not hesitate to call a man foolish 
he rejects the teaching of experience because no logical proof 
forthcoming, or because he does not know or does not accept 
le rules of the scientific craft. You find such suiier-logical 
iople sporadically among pure mathematicians, theologians, 
id philosophers, while there are besides vast communities of 
jople ignorant of or rejecting the rules of science, among them 
Le members of anti-vaccination societies and believers in 
itrology. It is useless to argue with them; I cannot compel 
tern to accept the same criteria of valid induction in which I 
>lieve: the code of scientific rules. For there is no logical 
gument for doing so; it is a question of faith. In this sense I 
n willing to call induction a metaphysical principle, namely 
mething beyond physics. 

After this excursion, let us return to causality and its two 
lys of apx)lication, one as a timeless relation of dependence, 
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the other as a dependence of one event fixed in time and space 
on another (see Appendix, 1 ). I think that the abstract, timeless 
meaning of causality is the fundamental one. This becomes 
quite evident if one tries to use the term in connexion with a 
specific case without implicit reference to the abstraction. For 
example: The statement that a bad harvest was the cause of the 
Indian famine makes sense only if one has in mind the timeless 
statement that bad harvests are causes of famines in general. 
I leave it to you to confirm this with the other examples I have 
given or with any more you may invent. If you drop this refer¬ 
ence to a general rule, the connexion between two consecutive 
events loses its character of causality, though it may still retain 
the feature of perfect regularity, as in the sequence of day and 
night. Another example is the time-table of a railway line. You 
can predict with its help the arrival at King’s Cross of the 
10 o’clock from Waverley; but you can hardly say that the time¬ 
table reveals a cause for this event. In other words, the law 
of the time-table is deterministic: You can predict future events 
from it, but the question ‘why ? ’ makes no sense. 

Therefore, I think one should not identify causality and 
determinism. The latter refers to rules which allow one to 
predict fi?om the knowledge of an event A the occurrence of an 
event JB (and vice versa), but without the idea that there is a 
physical timeless (and spaceless) link between all things of the 
kind A and ah things of the kind B. I prefer to use the ex¬ 
pression ‘causality’ mainly for this timeless dependence- It is 
exactly what experimentalists and observers mean when they 
trace a certain phenomenon to a certain cause by systematic 
variation of conditions. The other application of the word to 
two events following one another is, however, in so common use 
that it cannot be excluded. Therefore I suggest that it should be 
used also, but supplemented by some ‘attributes’ concerning 
time and space. It is always assumed that the cause precedes the 
effect; I propose to cah this the principle of antecedence. Further, 
it is generally regarded as repugnant to assume a thing to cause an 
effect at a place where it is not present, or to which it cannot be 
linked by other things ; I shall call this the principle of contiguity. 
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I shall now try to condense these considerations in a few 
definitions. 

Deteminim postulates that events at different times are 
connected by laws in such a way that predictions of un¬ 
known situations (past or future) can be made. 

By this formulation religious predestination is excluded, since 
it assumes that the book of destiny is only open to God. 

Causality postulates that there are laws by which the occur¬ 
rence of an entity B of a certain class depends on the 
occurrence of an entity A of another class, where the word 
‘entity’ means any physical object, phenomenon, situation, 
or event. A is called the cause, B the effect. 

If causahty refers to single events, the following attributes 
of causality must be considered: 

Antecedence postulates that the cause must be prior to, or at 
least simultaneous with, the effect. 

Contiguity postulates that cause and effect must be in spatial 
contact or comiected by a chain of intermediate things in 
contact. 
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ILLUSTRATIOlSr: ASTRONOMY AND PARTICLE 

MECHANICS 

I SHALL now iUnstrate these definitions by surveying the 
development of physical science. But do not expect an ordinary 
historical treatment. I shall not describe how a great man 
actually made his discoveries, nor do I much care what he him¬ 
self said about it. I shall try to analyse the scientific situation 
at the time of the discovery, judged by a modern mind, and 
describe them in terms of the definitions given. 

Let us begin with the oldest science, astronomy. Pre- 
Newtonian theory of celestial motions is an excellent example 
of a mathematical and deterministic, yet not causal, description. 
This holds for the Ptolemaic system as well as for the Copernican, 
including Kepler’s refinements. Ptolemy represented the motion 
of the planets by kinematic models, cycles, and epicycles rolling 
on one another and on the fixed heavenly sphere. Copernicus 
changed the standpoint and made the sun the centre of cyclic 
planetary motion, while Kepler replaced the cycles by ellipses. 

I do not wish to minimize the greatness of Copernicus’ step in 
regard to the conception of the Universe. I just consider it from 
the standpoint of the question which we are discussing. Neither 
Ptolemy nor Copernicus nor Kepler states a cause for the be¬ 
haviour of the planets, except the ultimate cause, the will of the 
Creator. What they do is, in modern mathematical language, 
the establishment of functions, 

for the coordinates of aU particles, depending on time. Coperni¬ 
cus himself claimed rightly that his functions, or more accurately 
the corresponding geometrical structures, are very much simpler 
than those of Ptolemy, but he refrained from advocating the 
cosmological consequences of his system. This question came 
to the foreground long after his death mainly by Galileo’s tele¬ 
scopic observations, which revealed in Jupiter and his satellites 
a repetition of the Copernican system on a smaller scale. 
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soartes’s cosmology can be regarded as an early attempt to 
lish causal laws for the planetary orbits by assuming a 
Jicated vortex motion of some kind of ether, and it is 
rkable that this construction satisfies contiguity. But it 
. because it lacked the main feature of scientific progress; 

3 not based on a reasonable induction from facts. Of course, 
de of rules existed, nor did Descartes’s writings provide it 
at time. The principles of the code accepted to-day are 
citly contained in the works of Galileo and ISfewton, who 
nstrated them with their actual discoveries in physics and 
aomy—in the same way as Haydn established the rules of 
mat a by writing lovely music in this form, 
lileo’s work precedes JNTewton’s not only in time but also 
jical order; for Galileo was experimenting with teri'estrial 
ts according to the rules of repetition and variation of con- 
IS, while ISTewton’s astronomical material was purely obser- 
aal and restricted. Galileo observed how a falling body 
s, and studied the conditions on which the motion depends. 
3sults can be condensed into the well-known formula for the 
;al coordinate of a small body or ‘particle’ as a function of 


^ (3.1) 

5 (7 is a constant, i.e. independent not only of time but also 
5 falling body. The only thing this quantity g can depend 
is the body towards which the motion is taking place, the 
—a conclusion which is almost too obvious to be formu- 
; for if the motion is checked by my hand, I feel the weight 
>ressnre directed downwards towards the earth. Hence the 
ant g must be interpreted as a property of the earth, not of 
tilling body. 

ng Newton’s calculus (denoting the time derivative by a 
md generalizing for all three coordinates, one obtains the 


ions 


= 0, ^ = 0, z = —g. 


i describe the trajectories of particles upon the earth with 
•ary initial j)ositions and velocities. 

3se formulae condense the description of an infinite number 
)its and motions in one single simple statement: that some 
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property of tlie motion is the same for the whole class, indepen¬ 
dent of the individual case, therefore depending only on the one 
other thing involved, namely the earth. Hence this property, 
namely the vertical acceleration, must be ‘due to the earth’, or 
‘caused by the earth’, or ‘a force exerted by the earth’. 

This word ‘force’ indicates a specification of the general 
notion of cause, namely a measurable cause, expressible in 
figures. Apart from this refinement, Galileo’s work is just a case 
of ordinary causality in the sense of my definition. 

Yet the law (3.2) involves time, since the ‘effect’ of the force 
is an acceleration, the rate of change of velocity in time. This 
is the actual result of observation and measurement, and has no 
metaphysical root whatsoever. A consequence of this fact is the 
deterministic character of the law (3.2); if the position and the 
velocity of a particle are given at any time, the equations deter¬ 
mine its position and velocity at any other time. 

In fact, any other time in the past or future. This shows that 
Galileo’s law does not conform to the postulate of antecedence: 
a given initial situation cannot be regarded as the cause of a 
later situation, because the relation between them is completely 
sjmimetrical; each determines the other. This is closely con¬ 
nected with the notion of time which Galileo used, and which 
ISTewton took care to define expficitly. 

The postulate of contiguity is also violated by Galileo’s law 
since the action of the earth on the moving particle needs 

contact. But this question is better discussed in 
connexion with Newton’s generalization. 

Newton applied Galileo’s method to the explanation of celestial 
motions. The material on which he based his deductions was 
scanty indeed; for at that time only six planets (including the 
earth) and a few satellites of these were known. I say ‘ deduc¬ 
tionsfor the essential induction had already been made by 
Kepler when he announced his three laws of planetary motion as 
vahd for planets in general. The first two laws, concerning the 
elliptic shape of the orbit and the increase of the area swept by 
the radius vector, were based mainly on Tycho Brahe’s observa¬ 
tions of Mars, i.e. of one single planet. Generalized by a sweeping 
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induction to any planet they are, according to Newton, equiva¬ 
lent to the statement that the acceleration is always directed 
towards the sun and varies inversely as the square of the distance 
r from the sun, where ya is a constant which may differ 

from planet to planet. But it is the third law which reveals the 
causal relation to the sun. It says that the ratio of the square 
of the j)eriod and the cube of the principal axis is the same for 
all planets—induced from data about the six known planets. 
This implies, as Newton showed (see Appendix, 2), that the 
constant ya is the same for all planets. Hence as in Galileo’s case, 
it can depend only on the single other body involved, the sun. 
In this way the interpretation is obtained that the centripetal 
acceleration {xr~^ is ‘due to the sun’, or ‘caused by the sun’, or 
‘a force exerted by the sun’. 

The moon and the other planetary satellites were then the 
material for the induction which led to the generalization of a 
mutual attraction of all bodies towards one another. The most 
amazing step, rightly admired by Newton’s contemporaries and 
later generations, was the inclusion of terrestrial bodies in the 
law derived from the heavens. This is in fact the idea symbolized 
by the apocryphal story of the falling apple: terrestrial gravity 
was regarded by Newton as identical with celestial attraction. 
By applying his laws of motion to the system earth-moon, he 
could calculate Galileo’s constant of gravity g from geodetical 
and astronomical data: namely, 

^ ^. 22^2 ’ (^*^) 

where r is the radius of the earth. It the distance between the 
centres of earth and moon, and T the time of revolution of 
the moon (sidereal month). 

The general equations for the motion of n particles under 
mutual gravitation read in modern vector notation 

(3.4) 

tl 



5 


(3.5) 
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wliere is the position vector of the particle a (a = 1, 2,,..,n), 
—r^l the distance of two particles a and p, V the 
potential of the gravitational forces. 

Newton also succeeded in generahzing the laws of motion for 
other non-gravitational forces by introducing the notion of mass, 
or more precisely of inertial mass. Newton’s method of repre¬ 
senting his results in an axiomatic form does not reveal the way 
he obtained them. It is, however, possible to regard this step as 
a case of ordinary causality derived by induction. One has to 
observe the acceleration of different particles produced by the 
same non-gravitational (say elastic) forces at the same point of 
space; they are found to differ, but not in direction, only in 
magnitude. Therefore, one can infer by induction the existence 
of a scalar factor characteristic for the resistance of a particle 
against acceleration or its inertia. This factor is called ‘mass’. 
It may still depend on velocity as is assumed in modern theory 
of relativity. This can be checked by experiment, and as in 
Newton’s time no such effect could be observed, the mass was 
regarded as a constant. 

Then the generalizedr'^uations of motions read 

= (3.6) 

where (a = 2 ,..., n) are the masses and the force vectors 

which depend on the mutual distances of all the particles. 
As in the case of gravitation, they may be derivable from a 
potential V by the operation 

F^=-grad^F, (3.7) 

where F is a function of the The most general form of F for 
forces inverse to the square of the distance would be (2' means 

od.jS 

summing over aU a, except a = jS) 

V=y'^, (3.8) 

where are constants; comparison with (3.4) and (3.5) shows, 
however, that these must have the form 




(3.9) 
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Newton applies further a law of symmetry, stated axiomatic- 

namely that action equals reaction, or from 

^ilich follows foifw 

(3.10) 


"^llere jfiT is a universal constant, the constant of gravitation, 
"^^^nce the constants of attraction or gravitational masses fx^ 
proportional to the inertial masses 
Neither Newton himself, nor many generations of physicists 
^^d astronomers after him have paid much attention to the law 
^^pressed by (3.10). Astronomical observations left little doubt 
^^at it was correct, and it was proved by terrestrial observations 
(^ith suitable pendulums) to hold with extreme accuracy 
(^otvos and others). Two centuries went by before Einstein saw 
fundamental problem contained in the simple equation 
C^.IO), and built on it the colossal structure of his theory of 
S^neral relativity, to which we have to return later. 

But this is not our concern here. We have to examine 
^^^ewton’s equations jfrom the standpoint of the principle of 
^^tusality. I hope I have made it clear that they imply the notion 
cause exactly in the same sense as it is always used by the 
^^cperimentalist, namely signifying a verifiable dependence of 
oxie thing or another. Yet this one thing is, in Galileo’s and 
T^ewton’s theory alike, a peculiar quantity, namely an accelera- 
"bion. The peculiarity is not only that it cannot be seen or read 
fx*om a measuring tape, but that it contains the time implicitly. 
IiT fact, Newton’s equations determine the motion of a system 
in. time completely for any given initial state (position and 
N^olocity of all particles involved). In this way, " causation ’ leads 
■fco * determination ’, not as a new metaphysical principle, but as a 
j>liysical fact, like any other. However, just as in Galileo’s 
simpler case, so here the relation between two consecutive con¬ 
figurations of the system is mutual and symmetrical. This has a 
bearing on the question whether the principle of antecedence 
bolds. As this applies, according to our definition, only to the 
oause-effect relation between single events, one has to change 
tibe standpoint. Instead of considering the acceleration of one 
body to be caused by the other bodies, one considers two 
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consecutive configurations of the whole system and asks whether 
it makes sense to call the earlier one the cause of the later one. 
But it makes no sense, for the relation between the two states is 
symmetrical. One could, with the same right, call the later 
configuration the cause of the earlier one. 

The root of this symmetry is Newton’s definition of time. 
Whatever he says about the notion of time (in Principia, 
Scholium I) as a uniform flow, the use he makes of it contains 
nothing of a flow in one direction. Newton’s time is just an 
independent variable t appearing in the equations of motion, in 
such a way that if t is changed into — t, the equations remain 
the same. It follows that, if all velocities are reversed, the 
system just goes back the same way; it is completely reversible. 

Newton’s time variable t is obviously an idealization abstracted 
from simple mechanical models and astronomical observations, 
■fitting well into celestial motion, but not into ordinary experi¬ 
ence, To us it appears that fife on earth is going definitely in one 
direction, from past to future, from birth to death, and the 
perception of time in our mind is that of an irresistible and 
irreversible current. 

Another feature of Newton’s dynamics was repugnant to 
many of his contemporaries, in particular the followers of 
Descartes, whose cosmology, whatever else its shortcomings, 
satisfied the principle of contiguity, as I have called the condition 
that cause and effect should be in spatial contact. Newton’s 
forces, the quantitative expressions for causes of motion, are sup¬ 
posed to act through empty space, so that cause and effect are 
simultaneous whatever the distance. Newton himself refrained 
from entering into a metaphysical controversy and insisted that 
the facts led unambiguously to his results. Indeed, the language 
of facts was so strong that they silenced the philosophical objec¬ 
tions, and only when new facts revealed to a later generation the 
propagation of forces wdth finite velocity, was the problem of 
contiguity in gravitation taken up. In spite of these difficulties, 
Newton’s dynamics has served many generations of physicists 
and is useful, even indispensable, to-day. 



IV 


CONTIGUITY 

MECHANICS OF CONTINUOUS MEDIA 

Although I maintain that neither causality itself nor its 
attributes, which I called the principles of antecedence and 
contiguity, are metaphysical, and that only the inference by 
induction transcends experience, there is no doubt that these 
ideas have a strong power over the human mind, and we have 
evidence enough that they have influenced the development of 
classical physics. Much effort has been made to reconcile 
Newton’s laws with these postulates. Contiguity is closely bound 
up with the introduction of contact forces, pressures, tensions, 
first in ordinary material bodies, then in the electromagnetic 
ether, and thus to the idea of fields of forces; but the systematic 
application of contiguity to gravitation exploded Newton’s 
theory, which was superseded by Einstein’s relativity. Similar 
was the fate of the postulate of antecedence; it is closely boimd 
up with irreversibility in time, and found its first quantitative 
formulation in thermodynamics. The reconciliation of it with 
Newton’s laws was attempted by atomistics and physical 
statistics; the idea being that accumulations of immense num¬ 
bers of invisible Newtonian particles, atoms, or molecules appear 
to the observer to have the feature of irreversibility for statistical 
reasons. The atoms were first hypothetical, but soon they were 
taken seriously, and one began to search for them, with increas¬ 
ing success. They became more and more real, and finally even 
visible. And then it turned out that they were no Newtonian 
particles at all. Whereupon the whole classical physics exploded, 
to be replaced by quantum theory. Looked at from the point 
of view of our principles, the situation in quantum theory is 
reversed. Determinism (which is so prominent a characteristic 
of Newton’s theory) is abandoned, but contiguity and ante¬ 
cedence (violated by Newton’s laws) are preserved to a consider¬ 
able degree. Causality, which in my formulation is independent 
of antecedence and contiguity, is not affected by these changes: 

5131 ri 
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scientific work will always be the search for causal interdepen¬ 
dence of phenomena. 

After this summary of the following discussion, let us return to 
the question why violation of the principles of contiguity and ante¬ 
cedence in Newton’s theory was first accepted—^though not with¬ 
out protest—^but later amended and finally rejected. This change 
was due to the transition from celestial to terrestrial mechanics. 

The success of Newton’s theory was mainly in the field of 
planetary motion, and there it was overwhelming indeed. It is 
not my purpose to expand on the history of astronomy after 
Newton; it sufiSlces to remember that the power of analytical 
mechanics to describe and predict accurately the observations 
led many to the conviction that it was the final formulation of 
the ultimate laws of nature. 

The main attention was paid to the mathematical investiga¬ 
tion of the equations of motion, and the works of Lagrange, 
Laplace, Gauss, Hamilton, and many others are a lasting 
memorial of this epoch. Of all these writings, I shall dwell only 
for a moment on that of Hamilton, because his formulation of 
Newton’s laws is the most general and elegant one, and because 
they will be used over and over again in the following lectures. 
So permit me a short mathematical interlude which has nothing 
directly to do with cause and chance. 


Hamilton considers a system of particles described by any 
(in general non-Cartesian) coordinates then the poten¬ 

tial energy is a function of these, V{<q^, ^ 2 ,...) or shortly V{q), and 
the kinetic energy T a function of both the coordinates and the 
generalized velocities q^,..., T{q, q). He then defines general¬ 
ized momenta ^rn 

ol 


and regards the total energy T-f- F as a function of the q^ and 
ThisfuBction T+V = H(a,p) (4.2) 

is to-day called the Hamiltonian. 

The equations of motion assume the simple "canonical’ form 

dH . dH 

9.0. — o„ ’ VoL — ——> (4.3) 
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from which one roads at once tht^ cotiHervation law of energy, 

^HL sr- 0, H — const. (4.4) 

at 


It is this sot. of formulae which has survived the most vdolont 
revolution of physical itioas whi(;h has ovt'r taken place, the 
transit ion to (}uant.uin nii<H*ha,ni(‘.s. 

R('turning now to the* post-Nc'-wtonian perio<l th('re wa-s, 
siinult aiu‘oxis with t lu' ast.ronoiuical a.pplicat.ions and coidinna- 
tions of t lu' tlu'ory, a. lively interest, in applying it t-o ordinary 
t<*rn*st rial fihysics. Rv<»n her(‘ N(‘wt.on ha<i shown the wa.y a.nd 
had eaiculatt'd, for instaiua*, the* v(‘loc.ity of sound in a lluid. 
Kvtmtuaily the nu*t‘hani(^s oj’ elastic? solids brought. a.hout a 


moditication of Nc'wton's detifiit ion of force? whic?h sal.isfic^H con¬ 


tiguity. Much of this work is due? to t.hc? gnsit uiathe'ma.t.ician 
(’auchy, Hc‘ start(*d, as niany heddre? him, hy t rc'at.ing a solid as 
an aggrc'gatc' of tiny part.ieJc's, aeding on one* a,noth<?r with 

Newtonian noncont iguous force's of short, range* .anticipating 

to some* de‘gr(*e* tlie* niodt'i n atomistie? st.a-ndpoint.. But. theire* was 
of c*ourse\ at that, tiiue*. no <*vide‘neu* of t.he* physie?al regality of 
the^se* particd(‘s. In t he* physical apj)licat.ions rdl t,ra,e?e‘s e)f them 
we're* ohlit<‘rat(*d hy ave‘raging. '‘fhe? form of t,he‘S(? re?Hult,H sag- 
g«‘Hte‘d to ('auchy anot h<*r ine‘t hod ed'appre>a(*h wh(‘re‘ pa.rt.icle* 
m<‘chanics is e*oinpl«‘te»l 3 ‘ eiise'arde'd. Mat.t,e*r is e'onsiek'rcMl to he? a 
re*al e*oiitinuurn in the* ma,tlu*mat.i(*al se'iise*, so t.hat it. has a 
nu'aning to sp(‘ak of a fore'e* h(‘t,w<*(*n t.we? pie‘cu*s e)f matt.ca* 
se'paratc'd hy a surfa(‘e*. 'Fhis M<'<*ms t.o he*, from our mexlcrn 
standpoint', a st<‘p in the* wrong din‘(dion, as we* know ma4,tiC?r 
t.o he* discontinuous. But. ('aaiediy’s work sheave'el how <',on- 
tiguity e’oulel be* int r<»du(?<'<l int.o iiH*chani(‘H ; t.he^ importaiua? of 
this peiint. be*camc' cvi(Ie*nt whe‘n t he* ne*w m<*ihod was applie*d to 
the* c‘th<*r, the* carri<*r f>f light and of (*le‘(!t.ric and magiu'tie? for(?e.*!H. 
which <*ve*n to elay is st ill n'gareh'd as (’.ont.ituuaiH though it has 
lost, most of t.he* ehanmt.e'rist.ic. prop(*rt.ie'H e)f a, HuhHta.n^u^ and <?a.n 
hardly he* e'alle*<l a- <?ont.inuous un'diam. 

In this t.Iu'ory all laws appe*ar in t.he* form of part-ial elilfc^rcuitial 
c'epmtiems, in wliiedi t.he* t.hres* space?■(!oordina.te‘K appe*a,r tejged.hcr 
with the? t.ime* as inde‘pe?u<Ii*.nt, coe)rdina.t<*s. 
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I shall give a short sketch of the mechanics of continuotis 
media. 

Mass, velocity, and all other properties of matter are con¬ 
sidered continuously distributed in space. The mass per unit 
volume or density p is then a function of the space coordinates, 
and the same holds for the current of mass u = pv (namely the 
quantity of mass passing through a surface per unit area and 
unit time). The conservation (indestructibility) of mass then 
leads to the so-called continuity equation (see Appendix, 3) 

/5-f-div u = 0. (4.5) 

Concerning the forces, one has to assume that, if the sub¬ 
stance is regarded as separated into two parts by a surface, each 
part exerts a push or pull through this surface on the other which, 
measured per unit area, is called tension or stress. A simple 
mathematical consideration, based on the equilibrium conditions 
for the resultant forces acting on the surfaces of a volume 
element, shows that it suffices to define these tension forces for 
three non-coplanar surface elements, say those parallel to the 
three coordinate planes; the force on the element normal to x 
being Tp with components T^y, the other two forces 
correspondingly Ty {Ty,, Tyy, Ty,) and % {T,,, T,y, %,), Then the 
force on a surface element with the normal unit vector 


is given by (4.6) 

Application of the law of moments to a small volume element 
shows (see Appendix, 3) that 


T 


T 

-^zyi 


T 

■*-ZX 


'Vz -^zyi ■‘■zx -^xz’ ^xy '^yx' 

Hence the quantities T form a symmetrical matrix, the stress 
tensor 


T 

•^xx 
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•^xy 
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(4.8) 

T 

•zx 
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■^zy 
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ISTewton’s law applied 
equations 


to a volume element 


drv 


= divT, 


then leads to the 


(4.9) 
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where div T is a vector with the components 


(div T) 


dT„ 


vex 


X 


8x 


dT 

j __XV 


dT. 


xz 


dx 


dx 


(4.10) 


and djdt the operator 


dt 






(4.11) 


which is called the 'convective derivative’. 

(4.9) together with (4.5) are the new equations of motion 
which satisfy the postulate of contiguity. They are the proto¬ 
type for all subsequent field theories. In the present form they 
are still incomplete and rather void of meaning, as the stress 
tensor is not specified in its dependence on the physical condi¬ 
tions of the system—^just in the same way as Newton’s equations 
are void of meaning if the forces are not specified with their 
dependence on the configuration of the particles. The configura¬ 
tion of a continuous system cannot be described by the values 
of a finite number of variables, but by certain space functions, 
called 'strain-components’. They are defined in this way: 
A small (infinitesimal) volume of initially spherical shape will be 
transformed by the deformation into an ellipsoid; the equation 
of this has the form 

6110:^-1-6222/^+^33 ^^+2623 2/21-1-2c3i2:a;~-|-2ei2 iry = €, (4.12) 

where e is an (infinitesimal) constant, measuring the absolute 
dimensions, and C 22 j *--5 <^12 quantities depending on 

the position x, y, z of the centre of the sphere. These are the 
components of the strain tensor e. 

In the theory of elasticity it is assumed that the stress com¬ 
ponents are linear functions of the strain components 
(Hooke’s law). 

In hydrodynamics the relation between T and e involves 
space- and time-derivatives of In plastic solids the situation 
is still more complicated. 

We need not enter into these different branches of the 
mechanics in continuous media. The only important point for 
us is this: Contact forces spread not instantaneously but with 
finite velocity. This is the main feature distinguishing Cauchy’s 
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contiguous mechanics from Newton’s non-contiguous. The 
simplest example is an elastic fluid (liquid or gas). Here the 
stress tensor T has only diagonal elements which are equal and 
represent the pressure ^29. The configuration can also be described 
by one variable, the density p or, for a given mass, the volume F. 
The relation between^ andF may be any function p ==/(F)—^we 
shall have to remember this later when we have to deal with 
thermod57namios. For small disturbances of equihbrium the 
general equations reduce to linear ones; any quantity ^ in an 
isotropic fluid (change of volume or pressure) satisfies the linear 
wave equation ^ 

^2 (4.13) 

where A is Laplace’s differential operator 

02 02 02 

A = divgrad = ^ + _ + _, (4.14) 

and c a constant which is easily found to mean the phase 
velocity of a plane harmonic wave 

^ ~ u4sin-^ {x — ct). 

A 

The equation (4.13) links up mechanics with other branches 
of physics which have independently developed, optics and 
electromagnetism. 

ELECTROMAGNETIC FIELDS 

The history of optics, in particular Newton’s contributions 
and his dispute with Huygens about the corpuscular or wave 
nature of light, is so well known that I need not speak about it. 
A hundred years after Newton, the wave nature of light was 
established by Young and Fresnel with the help of experiments 
on diffraction and interference. W^ave equations of the type 
(4.13) were used as a matter of course to describe the observa¬ 
tions, where now ^ means the amplitude of the vibration. 

Hut what is it that vibrates ? A name, ‘ether,’ was ready to 
hand, and its ability to propagate transverse waves suggested 
that it was comparable to an elastic solid. In this way it came 
to pass that the ether-flGlled vacuum was the carrier of contact 
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forces, spreading with finite velocity. They existed for a long 
period peacefully beside Newton’s instantaneous forces of 
gravitation, and other similar forces introduced to describe 
elementary experiences in electricity and magnetism. These 
forces are usually connected with Coulomb’s name, who verified 
them by direct measurements of the intensity of attraction and 
repulsion between small charged bodies, and between the 
poles of needle-shaped magnets. He found a law of the same 
type as that of Newton, of the form where the constant p, 

depends on the state of electrification or magnetization respec¬ 
tively of the interacting particles; by applying the law of 
action and reaction /x can be split into factors, fj, — in the 
electric case, where e^, eg are called the charges. It must, 
however, have been remarked that this law was already 
established earlier and with a higher degree of accuracy by 
Cavendish and Priestley by an indirect reasoning, with the help 
of the fact that a closed conductor screens a charged particle 
from the influence of outside charges; this argument, though still 
dressed up in the language of Newtonian forces, is already quite 
close to notions of field theory. 

It was the attempt to formulate the mechanical interactions 
between linear currents (in thin wires) in terms of Newtonian 
forces which entangled physics in the first part of the nineteenth 
century in serious difficulties. Meanwhile Paraday had begun 
his investigations unbiased by any mathematical theory, and 
accumulated direct evidence for understanding electric and 
magnetic phenomena with the help of contact forces. He spoke 
about pressures and tensions in the media surrounding charged 
bodies, using the expressions introduced in the theory of elas- 
ticity, yet with considerable and somewhat strange modifica¬ 
tions. Indeed, the strangeness of these assumptions made it 
difficult for his learned contemporaries to accept his ideas and 
to discard the well-established Newtonian fashion of descrip¬ 
tion. Yet seen from our modern standpoint, there is no intrinsic 
difference between the two methods, as long as only static and 
stationary phenomena are considered. Mathematical analysis 
shows that the resultant forces on observable bodies can be 
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expressed either as integrals over elementary contributions of 
the Newtonian—or better Coulombian—^type acting over the 
distance, or by surface integrals of tensions derived from field 
equations. This holds not only for conductors in vacuo^ but also 
for dielectric and magnetizable substances ; it is true that in the 
latter case the Coulombian forces lead to integral equations 
which are somewhat involved, but the differential equations of 
the field are, in spite of their simpler aspect, not intrinsically 
simpler. This is often overlooked in modern text-books. How¬ 
ever, in Faraday’s time this equivalence of differential and 
integral equations for the forces was not known, and if it had 
been, Faraday would not have cared. His conviction of the 
superiority of contact forces over Coulombian forces rested on 
his physical intuition. It needed another, more mathematically- 
minded genius. Clerk Maxwell, to find the clue which made it 
impossible to accept forces acting instantaneously over finite 
distances: the finite velocity of propagation. It is not easy to 
analyse exactly the epistemological and experimental founda¬ 
tions of Maxwell’s prediction, as his first papers make use of 
rather weird models and the purity of his thought appears only 
in his later publications. I think the process which led to Max¬ 
well’s equations, stripped of all unnecessary verbiage and round¬ 
about ways, was this: By combining all the known experimental 
facts about charges, magnetic poles, currents, and the forces 
between them, he could establish a set of field equations con¬ 
necting the spatial and temporal changes of the electric and 
magnetic field strength (force per unit charge) with the electric 
charge density and current. But if these were combined with 
the condition that any change of charge could occur only by 
means of a current (expressed by a continuity equation analo¬ 
gous to (4.5); see Appendix, 4), an inadequacy became obvious. 
In the language of that time, the result was formulated by 
saying that no open currents (like discharge of condensers) 
could be described by this theory. Therefore something was 
wrong in the equations, and an inspection showed a suspicious 
feature, a lack of symmetry. The terms expressing Faraday’s 
induction law (production of electric force by the time variation 
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of the magnetic field) had no counterpart obtained by exchanging 
the symbols for electric and magnetic quantities (production of 
magnetic force by the time variation of the electric field). With¬ 
out any direct experimental evidence Maxwell postulated this 
inverse effect and added to his equations the corresponding 
term, which expresses that a change of the electric field (dis¬ 
placement current) is, in its magnetic action, equivalent to an 
ordinary current. It was a guess based on a belief in harmony. 
Yet by some mathematical reasoning it can be connected with 
one single but highly significant fact which sufficed to convince 
Maxwell of the correctness of his conjecture—just as Newton 
was convinced of the correctness of his law of gravitation by one 
single numerical coincidence, the calculation of terrestrial 
gravity from the moon’s orbit. Maxwell showed that his modified 
equations had solutions representing waves, the velocity, c, of 
which could be expressed in terms of purely electric and magnetic 
constants; for the vacuum c turned out to be equal to the ratio 
of a unit of charge measured electrostatically (by Coulomb’s 
law) and electromagnetically (by Oersted’s law). This ratio, a 
quantity of the dimensions of a velocity, was known from 
measurements by Kohlrausch and Weber, and its numerical 
value coincided with the velocity of light. That could hardly 
be accidental, indeed, and Maxwell could pronounce the electro¬ 
magnetic theory of light. 

The final confirmation of Maxwell’s theory was, after his 
death, obtained by Hertz’s discovery of electromagnetic waves. 

I cannot follow the further course of events in the establish¬ 
ment of electromagnetic theory. I only wish to stress the point 
that the use of contact forces and field equations, i.e. the 
establishment of contiguity, in electromagnetism was the result 
of a long struggle against preconceptions of Newtonian origin. 
This confirms my view that the question of contiguity is not a 
metaphysical one, but an empirical one. 

We have now to see whether the laws of electromagnetism 
satisfy the principle of antecedence. An inspection of Maxwell’s 
equations (see Appendix, 4) shows that a reversal of time, 
i —> —i, leaves everything, including the continuity equation, 
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Tinclianged, if tlie electric density and field are kept unchanged 
while the electric current and magnetic field are reversed- This 
is a kind of reversibility very similar to that of mechanics, 
where a change of the sign of aU velocities makes the system 
return to its initial state. The difference is only a practical one: 
a cha-nge of sign of all current densities and the whole magnetic 
field is not as simple to perform as that of a finite set of velocities. 
The situation is best seen by considering an electromagnetic 
wave spreading fi?om a point source; the corresponding solution 
of Maxwell s equation is given by so-called retarded potentials 
wMoh express the electromagnetic state at a point JP for the time 
t in terms of the motion of the source at the time t —^/c, where t 
is the distance of P from the source. But there also exist other 
solutions, advanced potentials, which refer to the later time 
and represent a wave contracting towards the source. 

Such contracting waves are of course necessary for solving 
certain problems. Imagine, for instance, a spherical wave 
reflected by a concentric spherical mirror. However, such a 
muTor must be absolutely perfect to do its duty, and there 
appears to be something improbable about the occurrence of 
advanced potentials in nature. For the description of ele¬ 
mentary processes of emission of atoms or electrons one has 
supplemented Maxwell’s equations by the rule that only retarded 
solutions are allowed. In this way a kind of irreversibility can 
e introduced and the principle of antecedence satisfied. But 
, together artificial and unsatisfactory. The irreversi¬ 

bility of actual electromagnetic processes has its roots in other 
tacts which we shaU later have to describe in detail. Maxwell’s 
equations themselves do not satisfy the postulate of antecedence 


relativity and the eield theory of gravitation 

fo^nd reached is that which I 

SttZr Newton’s Lchanic: 

l„tt MaxweU’s electrodynamics, the 

latter two satisfymg the postulate of contiguity. Of these theories 
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Maxwell’s seemed to be the most promising and fertile, and the 
idea began to spread that possibly all forces of nature might be of 
electromagnetic origin. The problem had to b© envisaged, how to 
reconcile Newton’s gravitational forces with the postulate of con¬ 
tiguity; the solution was Einstein’s general theory of rchitivity. 

This is a long and interesting story by itself which involves 
not only the notion of cause with which we aro concermvl Iku’o, 
but other philosophical concepts, namely those concerned with 
space and time. A detailed discussion of these problems would 
lead us too far away from our subject, and I think it hardly 
necessary to dwell on them because relativity is to-da,y widely 
known and part of the syllabus of the student of inathematios 
and physics as well. So I shall give a very short outline only. 

The physical problems which led to the tlieory of I'clativity 
were those concerned with the optical and electromagnetic 
phenomena of fast-moving bodies. There arc two types of 
experiments: those using the high velocity of celestial bodies 
(e.g. Michelson’s and Morley’s experiment) and those tising fast 
electrons or ions (e.g. Bucherer’s measurement of the mass of 
electrons in cathode rays as a function of tho velocity). l''h<‘. work 
of Eoientz, EitzOerald, Poincare, and others prepared the ground 
for Einstein’s discovery that the root of all diffimdties was the 


assumption of a universal time valid for all moving systtunH of 
reference. He showed that this assumption has no foumlation 


in any possible experience and ho replaced it by a simple dc^hni- 
tion of relative time, valid in a given coordinatt^ system, but 
different from the time of another system in r<^hitiv(b motion. 
The formal law of transformation from one sjiacxvtitne systtun 
to another was already known, owijig to an analysis of honuvtz; 
it is in fact an intrinsic property of MaxwolFs ociuations. Tho 
Lorentz transformation is linear; it expresses th (3 physical 

equivalence of systems in relative motion witli constant velocit y 
(see Appendix, 5). 


Einstein’s theory of gravitation is formally based on a gem u-al- 
ization of these transformations into arbitrary, non-liiuvir otum ; 
with the help of these one can express tho transition from one 
system of reference to another accelerated (and simultaneously 
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deformed) one. The physical idea behind this mathematical 
formalism has been already mentioned: the exact proportion¬ 
ality of mass, as defined by inertia, and of mass as defined by 
gravitation, equation (3.10); or, in other words, the fact that 
in Newton’s law of gravitational motion (3.4) the (inertial) mass 
does not appear. 

Einstein succeeded in establishing equations for the gravita¬ 
tional field by identifying the components of this field with the 
quantities which define the geometry of space-time, namely 
the coefficients of the line element 

ds^ = (4.15) 

where x^, x^, stand for the space coordinates x, z, x^ for the 
time t. 

In ordinary 3-dimensional Euclidean geometry the g^^ are 
constant and can, by a proper choice of the coordinate system, 
be normalized in such a way that 

= 0 for /X V. 

Minkowski showed that special relativity can be regarded as 
a 4-dimensional geometry, where time is added as the fourth 
coordinate, but still with constant which can be normalized to 

9^11 = ^^22 = 9zz = 5^44 = ^ iov IJL V. (4.16) 

It was further known from the work of Eiemann that a very 
general type of non-Euclidean geometry in 3-dimensional space 
could be obtained by taking the g^^ as variable functions of 
x^, x^, and the mathematical properties of this geometry had 
been thoroughly studied (Levi-Civita, Ricci). 

Einstein generalized Eiemann’s formalism to four dimensions, 
assuming that the g^^ depend not only on x^, x^, x^, but also on 
x^, the time. However, he regarded the g^^ not as given functions 
of x^, x^, x^, x^ but as field quantities to be calculated from the 
distribution of matter. He formed a set of quantities which 
can be regarded as a measure of the ‘curvature’ of space and are 
functions of the g^^ and their first and second derivatives, and 
postulated equations of the form 

^fJiV (4.17) 
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where ac is a constant and the are generalizations of the 
tensions in matter, defined in (4.8): one has to supplement the 
tensor T by a fourth row and column, whej?e !7^4 are 

the components of the density of momentum, the density of 
energy. These equations (4.17) are invariant in a very general 
sense, namely for all continuous transformations of space-time, 
and they are essentially uniquely determined by this property 
and the postulate that no higher derivatives than the second- 
order ones should appear. 

If the distribution of matter is given, i.e. the are known, 
the field equations (4.17) allow one to calculate the i.e. the 
geometry of space. Einstein found the solution for a mass point 
as source of the field, and by assuming that the motion of another 
particle was determined by a geodesic, or shortest, or straightest 
line in this geometry, he showed that Newton’s laws of planetary 
motion follow as a first approximation. But higher approxima¬ 
tions lead to small deviations, some of which can be observed. 
I cannot enter into the discussion of all the consequences of the 
new gravitational theory; Einstein’s predictions have been con¬ 
firmed, although some of them are at the limit of observational 
technique. But I wish to add a remark about a theoretical point 
which is not so well known, yet very important. The assump¬ 
tion that the motion of a particle is given by a geodesic is 
obviously an unsatisfactory feature ; one would expect that the 
field equations alone should determine not only the field pro¬ 
duced by particles but also the reaction of the particles to the 
field, that is their motion. Einstein, with his collaborators 
Infeld and Hoffmann, has proved that this is in fact the case, 
and the same result has been obtained index^endently and, as I 
think, in a considerably simpler way, by the Bussian physicist, 
Fock. On the basis of these admirable pajjers, one can say that 
the field theory of gravitation is logically perfect—whether it 
will stand all observational tests remains to be seen. 

From the standpoint of the philosophical problem, which is 
the subject of these lectures, there are several conclusions to be 
drawn. The first is that now physical geometry, that is, not some 
abstract mathematical system but the geometrical aspect of the 
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beliaviour of actual bodies, is subject to the cause—effect relation, 
and to all related principles like any other branch of science. 
The mathematicians often stress the opposite point of view; 
they speak of the geometrization of physics, but though it cannot 
be denied that the mathematical beauty of this method has 
inspired numerous valuable investigations, it seems to me an 
over-estimation of the formalism. The main point is that 
Einstein’s geometrical mechanics or mechanical geometry 
satisfies the principle of contiguity. On the other hand, ante¬ 
cedence, applied to two consecutive configurations as cause and 
effect, is not satisfied, or not more than in electrodynamics; for 
there is no intrinsic direction in the flow of time contained in 
the equations. The theory is deterministic, at least in principle; 
the future or past motion of particles and the distribution of the 
gravitational field are predictable from the equations, if the 
situation at a given time is known, together with boundary 
conditions (vanishing of field at infinity) for all times. But as 
the gravitational field travels between the particles with finite 
velocity, this statement is not identical with ISTewtonian deter¬ 
minism : a knowledge is needed, not only of all particles, but also 
of all gravitational waves (which do not exist in Newton’s 
theory). Einstein himself values the deterministic feature of his 
theory very highly. He regards it as a postulate which has to 
be demanded from any physical theory, and he rejects, there¬ 
fore, parts of modern physics which do not satisfy it. 

Here I only wish to remark that determinism in field theories 
seems to me of very Httle significance. To illustrate the power 
of mechanics, Laplace invented a super-mathematician able to 
predict the future of the world provided the positions and veloci¬ 
ties of aU particles at one moment were given to him. I can 
sympathize with him in his arduous task. But I would really 
pity him if he had not only to solve the numerous ordinary 
differential equations of Newtonian type but also the partial 

differential equations of the field theory with the particles as 
singularities. 
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We have now to discuss the experiences which make it possible 
to distinguish in an objective empirical way between past and 
future or, in our terminology, to establish the principle of 
antecedence in the chain of cause and elFect. These experiences 
are connected with the production and transfer of heat. There 
would be a long story to tell about the preliminary steps 
necessary to translate the subjective phenomena of hot and cold 
into the objective language of physics: the distinction between 
the quality ‘temperature’ and the quantity ‘heat’, together 
with the invention of the corresponding instruments, the ther¬ 
mometer and calorimeter. I take the technical side of this 
development to bo well known and I shall use the thermal 
concepts in the usual way, although I shall have to analyse them 
presently from the standpoint of scientific methodology. It was 
only natural that the measurable quantity heat was first 
regarded as a kind of invisible substance called caloric. The 
flow of heat was treated with the methods developed for material 
liquids, yet with one important difference; the inertia of the 
caloric fluid seemed to be negligible; its flow was determined 
by a differential equation which is not of the second but of the 
first order in time. It is obtained from the continuity equation 

(see (4.5)) 0+divq = O (5.1) 

by assuming that the change of the density of heat Q is propor¬ 
tional to the change of temperature T, $Q == c ST (where c is the 
specific heat), while the current of heat q is proportional to the 
negative gradient of temperature, q — —k grad T (where k is 
the coefficient of conductivity). Hence 

dT 

(5.2) 

a differential equation of the first order in time. This equa¬ 
tion was the starting-point of one of the greatest discoveries 
in mathematics, Fourier’s theory of expansion of arbitrary 
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functions in. terms of ortliogonalsets of simpl© periodic functions, 
the prototype of numerous similar expansions and the embryo 
from which a considerable part of modern analysis and mathe¬ 
matical physics developed. 

But that is not the aspect from which we have hei*o to legaid 
the equation (5.2); it is this: 

The equation does not allow a change of t into t, the lesult 
cannot be compensated by a change of sign of other varial)leH 
as happens in Maxwell’s equations. Hence the solutions exhibit 
an essential difference of past and future, a definite fiow" of time 
as one is used to say—meaning, of course, a flow of events in 
time. Bor instance, an elementary solution of (5.2) foi* the 
temperature distribution in a thin wire along the rr-dircction is 


T-T^^ 


It 



which describes the spreading and levelling out of an initially 
high temperature concentrated near the point a: ■ 0, an 
obviously irreversible phenomenon. 

I do not know enough of the history of physics to uiiderstiind 
how this theory of heat conduction was reconciled with t.!u^ 
general conviction that the ultimate laws of physics wtu'c of tlu^ 
Newtonian reversible type. 

Before a solution of this problem could be attempted aaiotluvr 
important step was necessary: the discovery of the e(|ui\^a,Um(;e 
of heat and mechanical work, or, as we say to-day, of the lirsl» 
law of thermodynamics. It is important to remembei* t»l ia.tr this 
discovery was made considerably later than the invention of thes 
steam-engine. Not only the production of heat by mechanical 
work (e.g. through friction), but also the jjroduction of work 
from heat (steam-engine) was known. The new featui'c* wjim trhe 
statement that a given amount of heat always corresponds to a 
definite amount of mechanical work, its ‘mechanical equivalent’. 
Bobert Mayer pronounced this law on very scanty and indirect 
evidence, but obtained a fairly good value for the equivalent 
from known properties of gases, namely from the differenci^ of 
heat necessary to raise the temperature by on© degree if either 
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i>lie Yolume is kept constant or the gas allowed to do work 
a^gainst a constant pressure. Joule investigated the same 
j>3:*oblein by systematic experiments which proved the essential 
jpoint, namely that the work necessary to transfer a system from 
one equihbrium state to another depends only on these two 
shates, not on the process of application of the work. This is the 
real content of the first law; the determination of the numerical 
-v^alue of the mechanical equivalent, so much stressed in text- 
lb ooks, is a matter of physical technique. To get our notions 
olear, we have now to return to the logical and philosophical 
ToTindations of the theory of heat. 

The problem is to transform the subjective sense impressions 
of hot and cold into objective measurable statements. The latter 
£tre, of course, again somewhere connected with sense impres¬ 
sions. You cannot read an instrument without looking at it. 
3Bxxt there is a difference between this looking at, say, a thermo- 
moter with which a nurse measures the temperature of a patient 
the feeling of being hot under which the patient suffers. 

It is a general principle of science to rid itself as much as 
^possible from sense qualities. This is often misunderstood as 
^meaning elimination of sense impressions, which, of course, is 
€blbsurd. Science is based on observation, hence on the use of the 
sonses. The problem is to eliminate the subjective features and 
-to maintain only statements which can be confirmed by several 
indLividuals in an objective way. It is impossible to explain to 
€i,nybody what I mean by saying ‘This thing is red" or ‘This 
■fcixing is hot’. The most I can do is to find out whether other 
j>€5i'sons call the same things red or hot. Science aims at a closer 
relsition between word and fact. Its method consists in finding 
oor-relations of one kind of subjective sense impressions with 
obixer kinds, using the one as indicators for the other, and in this 
wa/y establishes what is called a fact of observation. 

Here I have ventured again into metaphysics. At least, a 
p>lxilosopher would claim that a thorough study of these 
rnetihodological principles is beyond physios. I think it is again 
J 5 b mle of our craft as scientists, like the principle of inductive 
inference, and I shall not analyse it further at this moment. 

G131 r> 
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In tlie case of thermal phenomena, the problem is to define 
the quantities involved—^temperature, heat—by means of 
observable objective changes in material bodies. It tnrns ont 
that the concepts of mechanics, configuration and force, strain 
and stress, suffice for this purpose, but that the laws of mechanics 
have to be essentially changed. 

Let us consider for simplicity only systems of fluids, that is of 
continuous media, whose state in equilibrium is defined by one. 
single strain quantity, the density, instead of which we can also, 
for a given mass, take the total volume V. There is also only one 
stress quantity, the pressure p. From the standpoint of 
mechanics the pressure in equilibrium is a given function of the 
volume, p = /(F). 

Now all those experiences which are connected with the 
subjective impression of making the fluid hotter or colder, show 
that this law of mechanics is wrong: the pressure can be changed 
at constant volume—^namely ‘by heating’ or ‘by cooling’. 

Hence the pressure p can be regarded as an independent 
variable besides the volume V, and this is exactly what thermo¬ 
dynamics does. 

The generalization for more comphcated substances (such as 
those with rigidity or magnetic polarizability) is so obvious that 
I shall stick to the examples of fluids, characterized by tw^o 
thermodynamically independent variables V,p. Hut it is 
necessary to consider systems consisting of several fluids, and 

therefore one has to say a word about different kinds of contact 
between them. 

To shorten the expression, one introduces the idea of ‘walls’ 
separating different fluids. These walls are supposed to be so 
thin that they play no other part in the physical behaviour of 
the system than to define the interaction between two neigh¬ 
bouring fluids. We shaU assume every waU to be impenetrable 
a'lthough in theoretical chemistry semi-permeable 
partitions are used with great advantage. Two kinds of walls 
are to be considered. 

An adiabatic wall is defined by the property that equilibrium 

a body enclosed by it is not disturbed by any external process 
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as long as no part of the wall is moved (distance forces being 
excluded in the whole consideration). 

Two comments have to be made. The first is that the 
adiabatic property is here defined without using the notion of 
heat; that is essential, for as it is our aim to define the thermal 
concepts in mechanical terms, we cannot use them in the ele¬ 
mentary definitions. The second remark is that adiabatic 
enclosure of a system can be practically realized, as in the Dewar 
vessel or thermos flask, with a high degree of ap)proximation. 
Without this fact, thermodynamics would be utterly im¬ 
practicable. 

The ordinary presentations of this subject, though rather 
careless in their definitions, cannot avoid the assumption of the 
possibility of isolating a system thermally; without this no 
calorimeter would work and heat could not be measured. 

The second type of wall is the diathermanous wall, defined by 
the following property: if two bodies are separated by a diather¬ 
manous wall, they are not in equilibrium for arbitrary values of 
their variables Px^Vi a^i^d but only if a definite relation 

between these four quantities is satisfied 

Pipy, K. y^) = 0- (5-4) 

This is the expression of thermal contact; the wall is only intro¬ 
duced to symbolize the impossibility of exchange of material. 

The concept of temperature is based on the exjjerience that 
two bodies, being in thermal equilibrium with a third one, are 
also in thermal equilibrium with another. If w^e write (5.4) in 
the short form i^(l, 2) == 0, this property of equilibrium can be 
expressed by saying that of the three equations 

F{2, 3) = 0, F{Z, 1) = 0, F{1, 2) = 0, (5.5) 

any two always involve the third. This is only possible if 
(5.4) can be brought into the form 

tliPl* ~ f2{P2> ^)* 

ISTow one can use one of the two bodies, say 2, as thermometer 
and introduce the value of the function 


(5.7) 
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as empirical temperature. Then one has for the other body the 
so-called equation of state 

K) = (5-8) 


Any arbitrary function of ■& can be chosen as empirical tem¬ 
perature with equal right; the choice is restricted only by 
practical considerations. (It would be impractical to use a ther¬ 
mometric substance for which two distinguishable states are in 
thermal equilibrium.) The curves -O' = const, in they> F-plane are 
independent of the temperature scale; they are called isotherms. 

It is not superfluons to stress the extreme arbitrariness of the 
temperature scale. Any suitable property of any substance can 
be chosen as thermometric indicator, and if this is done, still 
the scale remains at our disposal. If we, for example, choose a 
gas at low pressures, because of the simplicity of the isothermal 
compression law pV — const., there is no reason to take pV = •& 
as measure of temperature: one could just as well take {pV)^ or 
^{pV). The defimtion of an * absolute’ scale of temperature was 
therefore an urgent problem which was solved by th e discovery 
of the second law of thermodynamics. 

The second fundamental concept of thermodynamics, that of 
heat, can be defined in terms of mechanical quantities by a 
proper interpretation of Joule’s experiments. As I have pointed 
out already, the gist of these experiments lies in the following 
fact: K a body in an adiabatic enclosure is brought from one 
(equihbrium) state to another by applying external work, the 
amount of this work is always the same in whatever form 

(mechanical, electrical, etc.) and manner (slow or fast, etc.) it is 
applied. 


Hence for a given initial state (y,,. F„) the work done adia- 

baticaUy is a fonotion U of the final state (p, V), and one can 
write 


W=U-Uo; ( 5 . 9 ) 

the Wtion U(p, V) is caUed the energy of the system. It is a 
qu^tity directly measurable by mechanical methods 

^ non-adiabatic process leading from the 

difference 

Uo W wiU not he zero, but can be determined if the energy 
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function V) is known from previous experiment. 


diiference 


U—Uo—W = Q 


This 

(5.10) 


is called the heat supplied to the system during the process. 
Equation (5,10) is the definition of heat in terms of mechanical 
quantities. 

This procedure presupposes that mechanical work is measur¬ 
able however it is applied; that means, for example, that the 
displacements of and the forces on the surface of a stirring- 
wheel in a fluid, or the current and resistance of a wire heating 
the fluid, must be registered even for the most violent reactions. 
Practically this is difficult, and one uses either stationary pro¬ 
cesses of a comparatively long duration where the irregular 
initial and final stages can be neglected (this includes heating 
by a stationary current), or extremely slow, ‘quasi-static’ 
processes; these are in general (practically) reversible, since no 
kinetic energy is produced which could be irreversibly destroyed 
by friction. In ordinary thermodynamics on© regards every 
curve in the p F-plane as the diagram of a reversible jjrocess; 
that means that one allows infinitely slow heating or cooling by 
bringing the system into thermal contact with a series of large 
heat reservoirs which differ by small amounts of temperature. 
Such an assumption is artificial; it does not even remotely 
correspond to a real experiment. It is also quite superfluous. 
We can restrict ourselves to adiabatic quasi-static processes, 
consisting of slow movements of the (adiabatic) walls. Ifor those 
the work done on a simple fluid is 


dW== -pdV, (5,11) 

where p is the equilibrium pressure, and the first theorem of 
thermodynamics (5.10) assumes the form 


dQ = dU-\-pdV == 0. (5.12) 

Eor systems of fluids separated by adiabatic or diathormanous 
walls the energy and the work done are additive (according to 
our definition of the walls); hence, for instance, 

dQ = dQ-jr-\~dQ2 dU~{~PxdVi^~\~p2^(ZV2, 

U = 14 + a. 


where 


(5.1,‘J) 
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This equation is of course only of interest for the case of 
thermal contact where the equation (5.6) holds; the system has 
then only three independent variables, for which one can choose 
TJ, TJ and the temperature d', defined by (5.7) and (5.8). Then 

= C4(TJ, d')y £4 ~ (5.13) takes the form 

(5.14) 

Every adiabatic quasi-static process can be represented as a 
fine in the three-dimensional i9’-space which satisfies this 
equation; let us call these for brevity ‘adiabatic lines’. 

Equation (5.14) is a differential equation of a type studied by 
Pfaff. Pfafiian equations are the mathematical expression of 
elementary thermal experiences, and one would expect that the 
laws of thermodynamics are connected with their properties. 
That is indeed the case, as Caratheodory has shown. Eut 
classical thermodynamics proceeded in quite a different way, 
introducing the conception of idealized thermal machines which 
transform heat into work and vice versa (WilHam Thomson— 
Lord Kelvin), or which pump heat from one reservoir into 
another (Clausius). The second law of thermodynamics is then 
derived from the assumption that not all processes of this kind 
are possible: you cannot transform heat completely into work, 
nor bring it from a state of lower temperature to one of higher 
‘without compensation’ (see Appendix, 6). These are new and 
strange conceptions, obviously borrowed from engineering. 
I have mentioned that the steam-engine existed before thermo¬ 
dynamics ; it was a matter of course at that time to use the 
notions and experiences of the engineer to obtain the laws of 
heat transformation, and the establishment of the abstract 
concepts of entropy and absolute temperature by this method 
is a wonderful achievement. It would be ridiculous to feel any¬ 
thing but admiration for the men who invented these methods. 
Put even as a student, I thought that they deviated too much 
from the ordinary methods of physics; I discussed the problem 
with my mathematical friend, Caratheodory, with the result 
that he analysed it and produced a much more satisfactory 
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solution. This was about forty years ago, but still all text¬ 
books reproduce the "classical’ method, and I am almost certain 
that the same holds for the great majority of lectures—1 know, 
however, a few exceptions, namely those of the late It. H. howler 
and his school. This state of affairs seems to me one of un h ealthy 
conservatism. I take in these lectures an oppoi'tiinity to advo¬ 
cate a change. 

The central point of Carath^odory’s method is this. Tlie 
principles from which Kelvin and Clausius derived the second 
law are formulated in such a way as to dover tlie greatest 
possible range of processes incapable of execution: in no way 
whatever can heat be completely transformed into work or 
raised to a higher level of temperature. Carathdodory rcmia.rked 
that it is perfectly sufficient to know the existence of .some 
impossible processes to derive the second law. I need hardly 
say that this is a logical advantage. Moreover, the impossibio 
processes are already obtained by scrutinizing Joule’s oxi>ori~ 
ments a little more carefully. They consisted in bi’ingiiig a. 
system in an adiabatic enclosure from one equilibi-iutn stati*. to 
another by doing external work : it is an elemontaiy ex peirioiuse, 
almost obvious, that you cannot get your work back by retvei'sing 
the process. And that holds however near tl).e two statt'^s ar<3. 
One can therefore say that there exist adiabatically ina.c*-(;essi])Ie 
states in any vicinity of a given state. That is Carat!idodory’s 
principle. 

In particular, there are neighbour states of airy given 
one which are inaccessible by quasi-static adiabatic procuNsses. 
These are represented by adiabatic lines satisfying the 
Tfaffian equation (5.14). Therefore the question a,i’is(w: Does 
Caratheodory’s postulate hold for any Tfalfian or does iii inc'an 
a restriction ? 

The latter is the case, and it can be seen by very siinplo 
mathematics indeed, of which I shall give here a shoi't skcd-ch 
(see Appendix, 7). 

Let us first consider a Pfaffian equation of two variabicss, 
X and y. 


dq = X dx-\-Y dy^ 


(5.15) 
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where X.^Y are ftmctions of x,y. This is equivalent to the 
ordinary differential equation 


dx~ T’ 


(5.16) 


which has an infinite number of solutions <j>{x),y) — const., 
representing a one-parameter set of curves in the {x, y)-plane. 
Along any of these curves one has 


d4> 


^dxJr^dy = 0 , 

dx By 


(5.17) 


and this must be the same condition as the given Pfaffian; 
hence one must have 


dQ = X dcf>. 


(5.18) 


Each PfafiS.an dQ of two variables has therefore an ‘'integrating 
denominator^ A, so that dQ/X is a total differential. 

For Pfaffians of three (or more) variables, 


dQ = X dx-\-Y dy-\-Z dz (5.19) 

this does not hold. It is easy to give analytical examples (see 
Appendix, 7); but one can see it geometrically in this way: if in 
(5.19) dx, dy, dz are regarded as finite differences ^ — x, rj — y, Z, — z, 
it is the equation of a plane through the point x, y, z', one has a 
plane through each point of space, continuously varying in 
orientation with the position of this point. Now if a function <(> 
existed, these planes would have to be tangential to the surfaces 
<f>{^>y>^) = const. But one can construct continuously varying 
sets of planes which are not ‘integrablei.e. tangential to a sc^t 
of surfaces. For example, take all circular screws with the same 
axis, but varying radius and pitch, and construct at each point 
of every screw the normal plane; these obviously form a non- 
integrable set of planes. 

Hence all Pfaf&ans can be separated into two classes: those of 
the form dQ — Xd<f>, which have an ‘integrating denominator’ 
and represent the tangential planes of a set of surfaces ^ = const., 
and those which lack this property. 

Now in the first case, dQ = Xd(f>, any line satisfying the 
Pfaffian equation (5.19) must He in the surface <f> = const. 
Hence an arbitrary pair of points Pq and P in the a-’ys-space 
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cannot be connected by such a line. This is quite elementary. 
Not quite so obvious is the inverse statement which is used in the 
thermodynamic application: If there are points P in any 
vicinity of a given point which cannot be connected with Pq 
by a line satisfying the Pfaf&an equation (5.19), then there 
exists an integrating denominator and one has dQ — Xd<f>. 

One can intuitively understand this theorem by a continuity 
consideration: All points P inaccessible from Pq will fill a certain 
volume, bound by a surface of accessible points going through Pq. 
Further, to each inaccessible point there corresponds another 
one in the opposite direction; hence the boundary surface must 
contain all accessible points: which proves the existence of the 
function (j>, so that dQ = Xd<f> (see Appendix, 7). 

The application of this theorem to thermodynamics is now 
simple. Combining it with Carath^odory’s principle, one has for 
any two systems 

dQ-^ ■= Xid<f>-j^y dQ^ = X2^d<f>2y (5.20) 

and for the combined system 

dQ — dQ2r{-dQ2, — Xdcl>; (5,21) 

hence Xd4> — X^d4>x-\-Xc^d<j>.2^. (5.22) 

Consider in particular two simple fluids in thermal contact; 
then the system has three independent variables which 

can be replaced by Then (5.22) shows that depends 

only on ^i, 9^25 while 


d<f> _Ag 

Hence these quotients are also independent of d'. 


d 



from which one infers 


d Ao 


dd' X 


o. 


(5.23) 


1 aAi _ 1 aAg _ 1 aA 

Ai dd' ~~ Ag ~X^‘ 


(5.24) 


Now Aj^ is a variable of the first fluid only, therefore only 
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dependent on 4>i ^ same way == X 2 {<f> 2 >^)- The 

first equality (5.24) can only hold if both quantities depend only 
on;^. Hence 


elogAi alogAg Slog A 


(5.25) 


where g-^) is a universal function, namely numerically identical 
for different fluids and for the combined system. 

This simple consideration leads with ordinary mathematics 
to the existence of a universal function of temperature. The rest 
is just a matter of normalization. From (5.25) one finds for each 
system 

logA= J ^W<J^+log«>, A = (5.26) 

where <I> depends on the corresponding </>. 

If one now defines 


fr(^) = ^ 

i J O(^) 


(5.27) 


where the constant G can be fixed by prescribing the value of 
T -^—^2 for two reproducible states of some normal substance 
(e.g. —Tg = 100°, if corresponds to the boiling-point, 

the freezing-point of water at 1 atmosphere of pressure), then 

one has dQ = Xd<l> = Td8. (5.28) 

T is the thermodynamical or absolute temperature and S the 
entropy. 

Equation (5.28) refers only to quasi-static processes, that 
is, to sequences of equilibrium states. To get a result about 
real dynamical phenomena one has to apply Caratheodory’s 
principle again, considering a finite transition from an initial 
state FJ, F§, to a final state 1Pj, Tg, S. One can reach the latter 
one in two steps: first changing the volume quasi-statically 
(and adiabatically) from FJ, to TJ, the entropy remaining 
constant, equal to and then changing the state adiabatically, 
but irreversibly (by stirring, etc.) at constant volume, so that 
goes over into S. 

How if any neighbouring value S of could be reached in 
this way, one would have a contradiction to Carathdodory’s 
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principle, as the volumes are of course arbitrarily changeable. 
Hence for each such process one must have either jS or 

jS ^ S^. Continuity demands that the same sign holds for all 
initial states; it holds also for different substances since the 
entropy is additive (as can be easily seen). The actual sign ^ 
or depends on the choice of the constant C in (5.27); if this is 
chosen so that T is positive, a single experience, say with a gas, 
shows that entropy never decreases. 

It may not be superfluous to add a remark on the behaviour 
of entropy for the case of conduction of heat. As thermo¬ 
dynamics has to do only with processes where the initial and 
final states are equilibria, stationary flow cannot be treated: one 
can only ask, What is the final state of two initially separated 
bodies brought into thermal contact ? The difficulty is that a 
change of entropy is only defined by quasi-static adiabatic 
processes ; the sudden change of thermal isolation into contact, 
however, is discontinuous and the processes inside the system 
not controllable. Yet one can reduce this process to the one 
considered before. By quasi-static adiabatic changes of volume 
the temperatures can be made equal without change of entropy; 
then contact can be made without discontinuity, and the initial 
volumes quasi-statically restored, again v/itliout a change of 
entropy. The situation is now the same as in tlie initial state 
considered before, and it follows that any process leading to the 
final state must increase the entropy. 

The whole chain of considerations can be generalized for more 
complicated systems without any difficulty. One has only to 
assume that all independent variables excej)t one are of the type 
represented by the volume, namely arbitrarily changeable. 

If one has to deal, as in chemistry, with substances wliich are 
mixtures of different components, one can regard the concentra¬ 
tions of these as arbitrailly variable with the help of somi- 
permeable walls and movable pistons (see Api:)endix, 8). 

By using thermodynamics a vast amount of knowledge has 
been accumulated not only in physics but in the borderland 
sciences of physico-chemistry, metallurgy, mineralogy, etc. Most 
of it refers to equilibria. In fact, the ex jjressioii' therm odynamios ’ 
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is misleading. The only dynamical statements possible a: 
concerned with the irreversible transitions from one eqnilibriti] 
state to another, and they are of a very modest character, givir 
the total increase of entropy or the decrease of free energ 
F — U — TjS. The irreversible process itself is outside the scoj 
of thermodynamics. 

The principle of antecedence is now satisfied; hut this gain ; 
paid for by the loss of all details of description which ordinar 
dynamics of continuous media supplies . 

Can this not be mended ? Why not apply the methods c 
Cauchy to thermal processes, by treating each volume elemer 
as a small thermodynamical system, and regarding not onl 
strain, stress, and energy, but also temperature and entropy a 
continuous functions in space ? This has of course been done 
but with limited success. The reason is that thermodymamics i 
definitely connected with walls or enclosures. We have used th 
adiabatic and diathermanous variety, and mentioned semi 
permeable * walls necessary for chemical separations; but j 
volume element is not surrounded by a wall, it is in free contac 
with its neighbourhood. The thermodynamic change to whicl 
it is subject depends therefore on the flux of energy and materia 
constituents through its boundary, which themselves cannot b< 
reduced to mechanics. In some limiting cases, one has founc 
simple solutions. Tor instance, when calculating the velocity o 
sound in a gas, one tried first for the relation between pressure 
p and density p the isothermal law p == cp where c is a constant 
but found no agreement with experiment; then one took th< 
adiabatic law p = cpy where y is the ratio of the specific heats ai 
constant pressure and constant volume (see Appendix, 9), whicl 
gave a much better result. The reason is that for fast vibrationj 
there is no time for heat to flow through the boundary of £ 
volume element which therefore behaves as if it were adiabati- 
cally enclosed. But by making the vibrations slower and slower 
one certainly gets into a region where this assumption does not 
hold any more. Then conduction of heat must be taken intc 
aecoymt. The hydrodymamical equations and those of heat con- 
uction have to be regarded as a simultaneous system. In this 



t, as I have never studied it; nor have the majority of physicists 
ihown much interest in this kind of thing. One knows that any 
lux of matter and energy can he fitted into Cauchy’s general 
scheme, and there is not much interest in doing it in the most 
jeneral way. Besides, each effect needs separate constants- 
).g. in hquids compressibihty, specific heat, conductivity of heat, 
ionstants of diffusion; in solids elastic constants and parameters 
iescribing plastic flow, etc., and very often these so-called con¬ 
stants turn out to be not constants, but to depend on other 


juantities (see Appendix, 10). 

Therefore one can rightly say that with ordinary thermo- 
iynamics the descriptive method of physics has come to its 
latural end. Something new had to appear. 







VI 

CHAlSrCE 

KINETIC THEOBY OF GASES 

The new ijiirn in piiysios wns tli6 introduction of citoniistics nnei 
statistics. 

To follow up the history of atomistics into the remote past is 
not in the plan of this lecture. We can take it for granted that 
since the days of Demokritos the hypothesis of matter being 
composed of ultimate and indivisible particles was familiar to 
every educated man. It was revived when the time was ripe. 
Lord Kelvin quotes frequently a Father Boscovicb as one of the 
first to use atomistic considerations to solve physical problems ; 
he lived in the eighteenth century, and there may have been 
others, of whom I know nothing, thinking on the same lines. 
The first systematic use of atomistics was made in chemistry, 
where it allowed the reduction of innumerable substances to a 
relatively small stock of elements. Physics followed consideiubly 
later because atomistics as such was of no great use without 
another fundamental idea, namely that the observable pro])ertios 
of matter are not intrinsic qualities of its smallest parts, but 
averages over distributions governed by the laws of cliance. 

The theory of probability itself, which expresses these hiws, is 
much older; it sprang not from the needs of natural science but fi’om 
gambling and other, more or less disreputable, human activities. 

The first use of probability considerations in science was made 
by Gauss in his theory of experimental errors. I can su])posc 
that every scientist knows the outlines of it, yet I have to dwell 
upon it for a few moments because of its fundamental and 
somewhat paradoxical aspect. It has a direct bearing on the 
method of inference by induction which is the backbone of all 
human experience. I have said that in my opinion the signifi¬ 
cance of this method in science consists in the establishment of a 
code of rules which form the constitution of science itself. Now 
the curious situation arises that this code of rules, which ensures 
the possibility of scientific laws, in particular of the cause-efiect 
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relation, contains besides many other prescriptions those related 
to observational errors, a branch of the theory of probability. 
This shows that the conception of chance enters into the very 
first steps of scientific activity, in virtue of the fact that no 
observation is absolutely correct. I think chance is a more 
fundamental conception than causality; for whether in a con¬ 
crete case a cause-effect relation holds or not can only be judged 
by applying the laws of chance to the observations. 

The history of science reveals a strong tendency to forget this. 
When a scientific theory is firmly established and confirmed, it 
changes its character and becomes a part of the metaphysical 
background of the age: a doctrine is transformed into a dogma. 
In fact no scientific doctrine has more than a probability value 
and is open to modification in the light of new experience. 

After this general remark, let us retuim to the question how 
the notion of chance and probability entered physics itself. 

As early as 1738 Daniel Bernoulli suggested the interpretation 
of gas pressure as the effect of the impact of numerous particles 
on the wall of the container. The actual development of the 
kinetic theory of gases was, however, accomplished much later, 
in the nineteenth century. 

The object of the theory was to explain the mechanical and 
thermodynamical properties of gas from the average behaviour 
of the molecules. For this purpose a statistical hypothesis was 
made, often called the ‘principle of molecular chaos’: for an 
‘ ideal ’ gas in a closed vessel and in absence of external forces all 
positions and all directions of velocity of the molecules are 
equally probable. 

Applied to a monatomic gas (the atoms are supposed to be 
mass points), this leads at once to a relation between volume F, 
pressure^, and mean energy JJ (see Appendix, 11) 

Vp = lU, ( 6 . 1 ) 

if the pressure p is interpreted as the total momentum trans¬ 
ferred to the wall by the impact of the molecules. One has now 
only to assume that the energy ?7 is a measure of temperature 
to obtain Boyle’s law of the isotherms. Then it follows from 
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thermodynamics that U is proportional to the absolute tempera¬ 
ture (see Appendix, 9 ) ; one has 

U == IBT, = RT, ( 6 . 2 ) 

where R is the ordinary gas constant- This is the complete 
equation of state (combined Boyle-Charles law), and one sees 
that the specific heat of a monatomic gas for constant volume 
is |jR. 

I have mentioned these things only to stress the point that the 
kinetic theory right from the beginning produced verifiable 
numerical results in abundance. There could be no doubt that 
it was right, but what did it really mean ? 

How is it possible that probability considerations can be 
superimposed upon the deterministic laws of mechanics without 
a clash? 

These laws connect the state at a time t to the initial state, 
at time Iq, by definite equations. They involve, however, no 
restriction on the initial state. This has to be determined by 
observation in every concrete case. But observations are not 
absolutely accurate; the results of measurements will suffer 
scattering according to Gauss’s rules of experimental errors. In 
the case of gas molecules, the situation is extreme; for owing 
to the smallness and excessive number of the molecules, there is 
almost perfect ignorance of the initial state. 

The only facts known are the geometrical restriction of the 
position of each molecule by the walls of the vessel, and some 
physical quantities of a crude nature, like the resultant pressure 
and the total energy: very little indeed in view of the number of 
molecules (about 10^® per c.c.). 

Hence it is legitimate to apply probability considerations to 
the initial state, for instance the hypothesis of molecular chaos. 
The statistical behaviour of any future state is then completely 
determined by the laws of mechanics. This is in particular the 
case for ‘statistical equilibrium when the observable properties 
are independent of time; in this case any later state must have 
the same statistical properties as the initial state (e.g. it must 
also satisfy the condition of molecular chaos). How can this be 
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mathematically formulated ? It is conYenient to use the equa¬ 
tions of motion in Hamilton’s canonical form (4.3, p. 18). The 
distribution is described by a function/(i, 

of all coordinates and momenta, and of time, such that fdpdq 
is the probability for finding the system at time i in a given 
element dpdq = dp^...dp^dq^...dq^. One can interpret this 
function as the density of a fluid in a 2w-dimensional jp^-space, 
called ‘ phase space ’; and, as no particles are supposed to dis¬ 
appear or to be generated, this fluid must satisfy a continuity 
equation, of the kind (4.5, p. 20), generalized for 2n dimensions, 
namely (see Appendix, 3) 


8/ I (S{fq^) . d{fp,c)\ _ „ 


(6.3) 


This reduces in virtue of the canonical equations (4.3), p. 18, 


to 




(6.4) 


where [H,/] is an abbreviation, the so-called Poisson bracket, 
namely 


iH.n 


af ZB 8f 


^ Sqk 




( 6 . 6 ) 


On the other hand, the convective derivative defined for three 
dimensions in (4.11, p. 21) may he generalized for 2re dimensions 
thus: .. sf 


dt 




qk 


^Pk 




( 6 . 6 ) 


Then (6.4) says that in virtue of the mechanical equations 

df 


dt 


0 . 


(6.7) 


The result expressed by the equivalent equations (6.4) and 
(6.7) is called Liouville’s theorem. The density function is an 
integral of the canonical equations, i.e. f — const, along any 
trajectory in phase space; in other words, the substance of the 
fluid is carried along by the motion in phase space, so that the 

/ = J fdpdq (6.8) 


integral 


5131 
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over any part of the substance moving in phase space is in* 
pendent of time. 

Any admissible distribution function, namely one for wh: 
the probabilities of a configuration at dijfferent times are co 
patible with the deterministic laws of mechanics, must be 
integral of motion, satisfying the partial differential equat: 
(6.4). For a closed system, i.e. one which is free from exter: 
disturbances (hke a gas in a sohd vessel), H is explicitly in 
pendent of time. The special case of statistical equilibri 
corresponds to certain time-independent solutions of (6.4), 
functions / satisfying 

An obvious integral of this equation is / = cl>(i5f), where ^ ir 
cates an arbitrary function. This case plays a prominent p 
in statistical mechanics. 

Yet before continuing with these very general considerati 
we had better return to the ideal gases and consider the kin< 
theory in more detail. In an ideal gas, the particles (ato 
molecules) are supposed to move independently of one anoti 
Hence the function/(p, g) is a product of N functions/(x,' 
each belonging to a single particle and all formally identi 
X is the position vector and ^ = ( 1 /m)p the velocity vector. T 
/ diz.d^ is the probabifity of finding a particle at time ^ at a speci 
element of volume and velocity. 

In the case where no external forces are present {dHjdt = 
dS.jd'K. = 0) the Hamiltoniaii reduces to the kinetic energy. 


[H,/] = 0. 




H = _ (iy'2m)p2. 

The hypothesis of molecular chaos is expressed by assuming 
be a function of alone. This is indeed a solution of (6.9), t 
can be written in the form f = <I)(H) mentioned above, 
other solution exists if the gas as a whole is homogeneous 
isotropic (i.e. all positions and directions are physically equ: 
lent; see Appendix, 12). 

The determination of the velocity distribution function j 
was recogmzed by Maxwell as a fundamental problem of kin 
theory: it is the quantitative formulation of the ‘law of cha] 
for tnis case. He gave several solutions; his first and simp 
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reasoning was this: Suppose the three components of velocity 
iif ^3 statistically independent, then 

/(?") = Mi+a+a) = (^.lo) 

This functional equation has the only solution (see Appendix, 13) 

. . ^ ( 6 . 11 ) 

where a, (3 are constants. 

This is Maxwell’s celebrated law of velocity distribution. 
However, the derivation given is objectionable, as the supposed 
independence of the velocity components is not obvious at all. 
I have mentioned it because the latest proof (and as I think the 
most satisfactory and rigorous and of the widest possible genera¬ 
lization) of the distribution formula uses exactly this Maxwellian 
argument, only applied to more suitable variables—as we shall 
presently see. 

Maxwell, being aware of this weakness, gave several other 
proofs which have been improved and modified by other authors. 
Eventually it appears that there are two main types of argu¬ 
ment: the equilibrium proof and the dynamical proof. We shall 
first consider the equilibrium proof in some detail. 

Assume each molecule to be a mechanical system with co¬ 
ordinates (ji, < 72 ’-” momenta Px,P 2 ’”-y which we write 
simply < 7 ,jT), and witiv a Hamiltonian FI{p^q). The interaction 
between the molecules will be neglected. The total number n 
and the total eiuugy U of the assembly of molecules are 
given. 

In order to apply the laws of probability it is convenient to 
reduce the continuous set of points in phase space to a dis¬ 
continuous enumerable set of volume elements. One divides 
the phase space into N small cells of volumes ojj V, 
where V is the total volume; hence 

I* (6.12) 

To each cell a value of the energy H{p, q) can be attached, say 
that corresponding to its centre; let these energies be eg,..., c^y. 
Now suppose the ])articles distributed over the cells so that there 
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are % in the first cell, in the second, etc., but of course with 
the restriction that the totals 


(6.13) 

72-2 ~ ^ (6.14-) 

are fixed. Xiouville’s theorem suggests that the probability of a 
single molecule being in a given cell is proportional to its volume. 
Making this assumption, one has to calculate the composite 
probability P for any distribution under the restric¬ 

tions (6.13) and (6.14). 

This is an elementary problem of the calculus of probability 
(see Appendix, 14) which can be solved in this way: First the 
second condition (6.14) is omitted; then the probability of a 
given distribution Wg,..., is 


jP( 72-25 72-25•••5 ^jv) 


72 ! 


% 


n 


(6.15) 


'N' 


If this is summed over all 72^, 722,...,72jv satisfying (6.13), one ob¬ 
tains by the elementary polynomial theorem 


2 F(72j^, 722,..., 72^) (6.16) 

because of (6.12)—as it must be if P is a properly normalized 
probability. 

It is well known that the polynomial coefficients 721 / 723 ^ I 72 - 2 !.. -7'2^y - 
have a sharp maximum for 723 = 723 = ... = 72^^; that means, if 
all cells have equal volumes (<03 = a >2 = ••• == oj^^) the uniform 
distribution would have an overwhelming probability. Yet this 
is modified by the second condition (6.14) which we have now 
to take into account. The simplest method of doing this proceeds 
in three approximations which seem to be crude, but are perfectly 
satisfactory for very large numbers of particles (72 -> 00 ). The 
first approximation consists in neglecting all distributions of 
comparatively small 723 , 722 ,..., 72jy-; then the can be treated as 
continuous variables. The second approximation consists in 
replacing the exact expression (6.15) by its asymptotic value for 
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large rij^ by using Stirling’s formula log(M-!)-> n(log ^—1) (see 
Appendix, 14), and the result is 

logP = ——-^2 10 ^^ 2 ——“Wjv log const. 

(6.17) 

The third approximation consists in the following assumption: 
the actual behaviour of a gas in statistical equilibrium is deter¬ 
mined solely by the state of maximum probability; all other 
states have so little chance to appear that they can be neglected. 

Hence one has to determine the maximum of log P given by 
(6.17) under the conditions (6.13) and (6.14). Using elementary 
calculus this leads at once to 

Uj, = (6.18) 

where a and ^ are two constants which are necessary in order to 
satisfy the conditions (6.13), (6.14). Yet these constants play a 
rather different part. 

If one has to do with a mixture of two gases A and B with given 
numbers and one gets two conditions of the type (6.13) 
but only one of the type (6.14), expressing that the total energy 
is given. 

Hence one obtains 

(6.19) 

with two different constants cx^^'> and but only one There¬ 
fore f3 is the parameter of thermal equilibrium between the two 
constituents and must depend only on temperature. 

Indeed, if one now calculates the mean energy U and the 
mean pressure p, one can apply thermodynamics and sees 
easily that the second law is satisfied if 

^ = ( 6 . 20 ) 

tv AL 

where T is the absolute temperature and h a constant, called 
Boltzmann’s constant. At the same time it appears that the 
entropy is given by 

™ /clogP == —2'^alc>g^o£* (6.21) 

OC 

All these results are mainly due to Boltzmann; in partic ular (6.18) 
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is called Boltzmann’s distribution law. It obviously contains 
Maxwell’s law (6.11) as a special case, namely for mass points. 

We have now to ask: Is this consideration which I called the 
equilibrium proof of the distribution law really satisfactory ? 

One objection can be easily dismissed, namely that the 
approximations made are too crude. They can be completely 
avoided. Darwin and Bowler have shown that one can give a 
rigorous expression for the mean value of any physical quantity 
in terms of complex integrals, containing the so-called ‘partition 
function’ (see Appendix 15) 

CO CO = -^(^)* (6.22) 

No distribution is neglected and no use is made of the Stirling 
formula. Yet in the limit n ^ oo, all results are exactly the same 
as given by the Boltzmann distribution function. Although this 
method is extremely elegant and powerful, it does not introduce 
any essential new feature in regard to the fundamental question 
of statistical mechanics. 

Another objection is going deeper: can the molecules of a gas 
really be treated as independent ? 

There are numerous phenomena which show they are not, 
even if one considers only statistical equilibrium. Bor no real 
gas is ‘ideal’, i.e. satisfies Boyle’s law rigorously, and the devia¬ 
tions increase with pressure, ending in a complete collapse, con¬ 
densation. This proves the existence of long-range attractive 
forces between the molecules. The statistical method described 
above is unable to deal with them. The first attempt to correct 
this was the celebrated theory of van der Waals, which was 
followed by many others. I shall later describe in a few words 
the modem version of these theories, which is, from a certain 
standpoint, rigorous and satisfactory. 

hlore serious are the interactions revealed by non-equilibrium 
phenomena: viscosity, conduction of heat, diffusion. They can 
aU be qualitatively understood by supposing that each molecule 
has a fimte volume, or more correctly that two molecules have a 
short-range repulsive interaction which prohibits a close 
approach. This assumption has the consequence that there 
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exists an efifective cross-section for a collision, hence a moan freu^ 
path for the straight motion of a molecule. The coefficiosniis of 
the three phenomena mentioned can be reduced, by ekuiionl ary 
considerations, to the free path, and the results, as far as they go, 
are in good agreement with observations. 

All this is very good physics producing in a siinplo and intuitive! 
way formulae which give the correct order of niagnit iulc* of 
different correlated effects. 

But for the problem of a rigorous kinetic theory, which ta.k<‘H 
account of the interactions and is valid not only for etjuilibriu., 
but also for motion, these considerations have only the. va inc! f>f 
a prehminary reconnoitring. The question is: flow (;an oru^ 
derive the hydrodynamical equations of visible motion togcd.lua* 
with the phenomena of transformation and conduction of lu^at 
and, for a mixture, of diffusion ? 

This is an ambitious programme. For such a t}i<H)ry must 
include the result that a gas left to itselt tends to C{{ui]i{u’ium. 
Hence it must lead to irreversibility, although tlic laws of* 
ordinary reversible mechanics are assumed to hold for mohv. 
cules. How is this possible ? Further, is the equilibrium oht,aim'd 
in this way the same as that derived directly, say by th('! int'fihod 
of the most probable distribution ? 

To begin with the last question. Its answer represents what* 1 
have called above the dynamical proof of the distrihuthm law 
for equilibrium. 


The formulation of the non-equilibrium theory olgaHi's is du<‘ 
to Boltzmann. One can obtain his fumlainental (‘((nation by 
generalizing one of the equivalent formulae (0.^) or {tb7). Tlu'se 
are based on the assumption that each molee.uk^ movess indi'peii- 
dently of the others according to the laws of nuudmuics, au( 1 Huy 
describe how the distribution/of an assembly of suelj jmrt ieles 
develops in time. Now the assumption of ind<‘peu(kne(. is 
dropped, hence the expression on the kdl-luind sid(^ of (0 4) (,r 

(6.7) is not zero; denoting by/(I) the probability density for a 
certain particle 1, one can write 


&!} _ 

dt ~dt 




( 6 . 23 ) 
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where <7(1) represents the influence of the other molecules on the 
particle 1; it is called the 'collision integraF, as Boltzmann cal¬ 
culates it only for the case where the orbit of the centre of a 
particle can be described as straight and uniform motion 
interrupted by sudden collisions. For this purpose a new and 
independent application of the laws of probability is made by 
assumingthattheprobabilityof a collision between two particles 1 
and 2 is proportional to the product of the probabilities of finding 
them in a given configuration, /(I )/(2). If one then expresses that 
some molecules are thrown by a collision out of a given element 
of phase space, others into it, one obtains (see Appendix, 16) 

<7(1) = JJ {f'(l)/'(2)-/(l)/(2)}|5,-?,| d\yd%^ (6.24) 

where/(2) is the same function as/(I), but taken for the particle 
2 as argument. /(I), /(2) refer to the motion of two particles 
.'before’ the collision,/'(I), /'(2) to that 'after’ the collision; one 
has to integrate over all velocities of the particle 2, (d^g)? ^nd over 
the 'cross-section’ of the collision, (db), which I shall not define 
in detail. ‘Before’ and 'after’ the collision mean the asymptotic 
straight and uniform motions of approach and separation; it is 
clear that if the former is given, the latter is completely deter¬ 
mined for any law of interaction force—it is the two-body problem 
of mechanics. Hence the velocities of both particles After 

the collision are known functions of those before the coUision 

?29 Sr^id (6.23) assumes the form of an integro-difierential 
equation for calculating /. 

This equation has been the object of thorough mathematical 
investigations, first by Boltzmann and Maxwell, and later by 
modem writers. Hilbert has indicated a systematic method of 
solution in which each step of approximation leads to an integral 
equation of the normal (so-called Fredholm) type. Enskog and 
Chapman have developed this method, with some modifications, 
m detafl. There is an admirable book by Chapman and Cowling 
which represents the whole theory of non-homogeneous gases as 
a consequence of the equation (6.23). I can only mention a few 
points of these important investigations. 

The first is concerned with the question of equilibrium. Does 
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the equation (6.23) really indicate an irreversible approach from 
any initial state to a homogeneous equilibrium ? This is in fact 
the case, and a very strange result indeed: the metamorphosis 
of reversible mechanics into irreversible thermodynamics with 
the help of probability. But before discussing this difficult 
question, I shall indicate the mathematical proof. 

Brom the statistics of equilibrium it is known how the entropy 
is connected with probability, namely by equation (6.21). 
Replace here the discontinuous by the continuous / and 
summation by integration over the phase space, and you obtain 

S = -k jfil)logf(l) dgdp. , (6.25) 

If one now calculates the time derivative dS jdt by substituting 
8f{\)ldt from (6,23), and assuming no external interference, one 
finds (see Appendix, 17) 

^ > 0, (6.26) 

where the = sign holds only if/(l) is independent of the space 
coordinates and satisfies, as a function of the velocities, 

/(l)/(2) -/'(l)/'(2) (6.27) 

identically for any collision. 

The result expressed by (6.26) is often quoted as Boltzmann’s 
jH'-theorem (because he used the symbol H for — Sflc). Boltz¬ 
mann claimed that it gave the statistical explanation of 
thermodynamical irreversibility. 

Equation (6.27) is a functional equation which determines / 
as a function of ‘'collision invariants’, like total energy and total 
momentum . If the gas is at rest as a whole, the only solution of 
(6.27) is Maxwell’s (or Boltzmann’s) distribution law: 

H{p,q) = €. (6.28) 

This is what I called the dynamical proof, and is a most remarkable 
result indeed; for it has been derived from the mechanism of 
collisions, which was completely neglected in the previous 
equihbrium methods. This point needs elucidation. 

Before doing so, let mo mention that the hydro-thermal 
equations of a gas, i.e. the equations of continuity, of motion and 
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of conduction of heat, are obtained from Boltzmann’s equation 
(6.23) by a simple formal process (multiplication with 1,^ and 
followed by integration over aU velocities) in terms of the 
stress tensor T—^you will remember Cauchy’s general formula 
(4.9)—^which itself is expressed in terms of the distribution 
function/. To give these equations a real meaning, one has to 
expand/in terms of physical quantities, and this is the object of 
the theories contained in Chapman and Cowling’s book. In this 
way avery satisfactory theory of hydro-thermodynamics of gases, 
including viscosity, conduction of heat, and diffusion, is obtained. 

STATISTICAL MECHANICS 

I remember that forty years ago when I began to read scientific 
literature there was a violent discussion raging about statistical 
methods in physics, especially the i^-theorem. The objections 
raised have been classified into two types, one concerning reversi¬ 
bility, the other periodicity. 

Lioschmidt, like Boltzmann, a member of the Austrian school, 
formulated the reversibility objection in this way: by reversing 
all velocities you get from any solution of the mechanical 
equations another one—how can the integral JS, which depends 
on the instantaneous situation, increase in both cases ? 

The periodicity objection is based on a theorem of the great 
French mathematician Henri Poincare, which states that every 
mechanical system is, if not exactly periodic, at least quasi- 
periodic. This follows from Liouville’s theorem according to 
which a given region in phase space moves without change of 
volume and describes therefore a tube-shaped region of ever 
increasing length. As the total volume available is finite (it is 
contained in the surface of maximum energy), this tube must 
somewhere intersect itself, which means that final and initial 
states come eventually near together. 

Zermelo, a German mathematician, who worked on abstract 
problems like the theory of Cantor’s sets and transfinite num¬ 
bers, ventured into physics by translating Gibbs’s work on 
statistical mechamcs into German. But he was offended by the 
ogical imperfections of this theory and attacked it violently. 

G use in particular Poincare s tlieorem to show how scancia- 
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lous the reasoning of the physicists was: they claimed to have 
proved the irreversible increase of a mechanical quantity for 
a system which returns after a finite time to its initial state with 
any desired accuracy. 

These objections were not quite futile, as they led two dis¬ 
tinguished physicists, Paul Ehrenfest and his wife Tatjana, to 
investigate and clear up the matter beyond doubt in their well- 
known article in vol. iv of the Mathematical Encyclopedia. 

To-day w e hai'dly need to follow all the logical finesses of this 
work. It suffices to point out that the objections are based on the 
following misunderstanding. If we describe the behaviour of the 
gas (we speak only of this simple case, as for no other case has 
the -theorem been proved until recently) by the equation (6.4), 
taking for H the Hamiltonian of the whole system, a function of 
the coordinates and momenta of all particles, then / is indeed 
reversible and quasi-periodic, no i?-theorem can be proved. 

Boltzmann’s proof is based not on this equation, but on (6.23), 
where now H is the Hamiltonian of one single molecule un¬ 
disturbed by the others, and where the right-hand term is not 
zero but equal to the collision integral <7(1). The latter is taken as 
representing roughly the effect of all the other molecules; 
‘roughly’, that means after some reasonable averaging. This 
averaging is the expression of our ignorance of the actual micro¬ 
scopic situation. Boltzmann’s theorem says that this equation 
mixing mechaincal knowledge with ignorance of detail leads to 
irreversibility. I'here is no contradiction between the two state¬ 
ments. 

But there rises the other question whether such a modification 
of the fundamental equation is justified. We shall see presently 
that it is indeed, in a much wider sense than that claimed by 
Boltzmaim, nainely not only for a gas, but for any substance 
which can be described by a mechanical model. We have there¬ 
fore now to take up the (question of how statistical methods can 
be ai)plied to gc^neral mechanical systems. Without such a 
theory, one cannot even treat the deviations from the so-called 
ideal behaviour of gases (Boyle’s law), which appear at high 
pressure and low teinpei-ature, and which lead to condensation. 
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Theories like that of van der Waals have obviously only 
liminary character. What is needed is a general and 
founded formalism covering the gaseous, liquid, and solid stat^^ ’ 
under all kind of external forces. 

Tor the case of statistical equilibrium, this formalism 
supplied by Willard Gibbs’s celebrated book on 
Mechanics (1901), which has proved to be extremely successT*^^ 
in its applications (see Appendix, 18). The gist of Gibbs’s id-^^ 
is to apply Boltzmann’s results for a real assembly of 
equal molecules to an imaginary or ‘virtual’ assembly of max^lV' 
copies of the system under consideration, and to postulate tht^*^ 
the one system under observation will behave like the avera^^ 
calculated for the assembly. Before criticizing this assumptioti? 
let us have a glimpse of Gibbs’s procedure. He starts from 
Liouville’s theorem (6.4) and considers especially the case oT 
equilibrium where the partition function/of his virtual assembl3^ 
has to satisfy equation (6.9). He states that / = ^{H) is a sol'ti."" 
tion (as we have seen) and he chooses two particular forms of 
the function <I>. The first is 


/== = const., if £J <: H c. jEJ-[~AJE, 

= 0 outside of this interval. 


( 6 . 20 ) 


where E is sl given energy and AE a small interval of energy- - 
(In modern notation one could write ^{H) = 8{H — E), whexro 
B is Dirac’s symbolic function.) The corresponding distributioxx 
he calls micro-canonical. 

The second form is just that of Maxwell-Boltzmann, 

f ^ H{p,q) = E, (6.30) 


andthe corresponding distributionis called canonical. Gibbs shows 
that both assumptions lead to the same results for the averagois 
of physical quantities. But the canonical is preferable, as it is 
simpler to handle, turns out again to be equal for systems irx 
thermal equilibrium; if one puts p ~ l/kT the formal relations 
between the averages constructed with (6.30) are a true replicsL 
of thermodynamics. Tor instance, the normalization condition 
for the probability 

J / dpdq = j* dpdq = 1 


(6.3X) 
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can be written 

= a /;3 = hT\o%Z, ^ = jj e-^l^^dpdq. ( 6 . 32 ) 

This jP plays the part of Helmholtz’s free energy. The integral 
Z, to-day usually called the ‘partition function’, depends, apart 
from the energy E, on molar parameters like the volume V. 
All physical properties can be obtained by differentiation, e.g. 
entropy 8 and pressure p by 


8 = 


dF 

~dT' 



(6.33) 


This formalism has been amazingly successful in treating thermo¬ 
mechanical and also thermo-chemical properties. For instance, 
the theory of real (non-ideal) gases is obtained by writing 

E = H{p,q) = K{p)-\- U{q), (6.34) 

where K is the kinetic and U the potential energy; the latter 
depends on the mutual interactions of the molecules. As K is 
quadratic in the p, the corresponding integration in Z is easily 
performed and the whole problem reduces to calculating the 
multiple integral 

Q ~ j* ... J dq-^dq2‘..dqj^. (6.35) 

Still, this is a very formidable task, and much work has been 
spent on it. I shall mention only the investigations initiated by 
Ursell, and perfected by Mayer and others, the aim of which was 
to replace van der Waals’s semi-empirical equation of state by 
an exact one. In fact one can expand Q into a series of powers of 
F“^ and, introducing this into (6.32) and (6.33), one obtains the 
pressure p as a similar series 


ETi A B 

pi p2 


(6.36) 


where the coefficients called virial coefficients, are 

functions of T. One can even go further and discuss the process 
of condensation, but the mathematical difficulties in the treat¬ 
ment of the liquid state itself are prohibitive. 

The range of Jipplication of Gibbs’s theory is enornious. Hut 
reading his book again, I felt the lack of a deeper foundation. 
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A few years later (1902, 1903) there appeared a series of papers 
by Einstein ia which the same formahsm was developed, 
obviously quite independently, as Maxwell and Eoltzmann are 
quoted, but not Gibbs; these papers contained two essential 
improvements: an attempt to justify the statistical assumptions, 
and an appHcation to a case which at once transformed the kinetic 
theory ofmatter from a useful hypothesis into something very real 
and directly observable, namely, the theory of Brownian motion. 

Concerning the foundation, Einstein used an argument which 
Boltzmann had already introduced to support his distribution 
law (6.18)—^though this seems to be hardly necessary, as for a 
real assembly the method of enumerating distributions over 
cells is perfectly satisfactory. Curiously enough, this argument 
of Boltzmann is based on a theorem similar to Poincare’s con¬ 
siderations on quasi-periodicity with which Zermelo intended 
to smash statistical mechanics altogether. Einstein considers a 
distribution of the micro-canonical type, in Gibbs’s nomen¬ 
clature, where only one ‘energy surface’ H{p,q) = jE 7 in the phase 
space is taken into account. The representative point in phase 
space moves always on this surface. It may happen that the 
whole surface is covered in such a way that the orbit passes 
through every point of the surface. Such systems are called 
ergodie; but it is rather doubtful whether they exist at all. 
Systems are called quasi-ergodic where the orbit comes near to 
every point of the energy surface; that this happens can be seen 
by an argument s imil ar to that which leads to Poincard’s theorem 
of quasi-periodicity. Then it can be made plausible that the total 
tune of sojourn of the moving point in a given part of the energy 
surface is proportional to its area; hence the time average of any 
function ofp, q is obviously the same as that taken with the help 
of a micro-canonical virtual assembly. In this way quasi¬ 
periodicity is used to justify statistical mechanics, exactly 
reversing Zermelo s reasoning. This paradox is resolved by the 
remark that Zermelo believes the period to be large and macro - 
scopic, while Einstein assumes it to be unobservably small. 

Who is right ? You may find the obvious answer for yourselves 
(see Appendix, 19). 
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Modern writers use other ways of establishing the foundations 
of statistical mechanics. They are mostly adaptations of the 
cell-method to the virtual assembly; one has then to explain why 
the average properties of a single real observed system can be 
obtained by averaging over a great many systems of the virtual 
assembly. Some say simply: As we do not know the real state, 
we have the right to expect the average pi*ovided exceptional 
situations are theoretically extremely rare—and this is of course 
the case. Others say we have not to do with a single isolated 
system, but with a system in thermal contact with its siirround- 
ings, as if it were in a thermostat or heat bath; wo can then 
assume this heat bath to consist of a great many copies of the 
system considered, so that the virtual assembly is transformed 
into a real one. I think considerations of this kind are not very 
satisfactory. 

There remains the fact that statistical mechanics has justified 
itself by explaining a great many actual phenomena. Among 
these are the fluctuations and the Brownian motion to which, 
Einstein applied his theory (see Appendix, 20). To appreciate 
the importance of this step one has to remomlier that/ at tliat 
time (about 1900) atoms and molecules were still far from being 
as real as they are to-day—there were still physicists who did 
not believe in them. After Einstein’s work tins was liardly 
possible any longer. Visible tiny particles suspended in a, ga-s or a 
liquid (colloid solution) are test bodies small enough to rcwea-l 
the granular structure of the surrounding nu'diurn by tluur 
irregular motion. Einstein showed tluit; tlie statistical properties 
of this movement (mean density, mean sepjare displacement in 
time, etc.) agree qualitatively with the predictions of kinetic 
theory. Perrin later confirmed these resuU.s by exaedv im^asurt^- 
ments and obtained the first reliable value: of Avogadro's num ¬ 
ber N, the number of particles per moh^. Prom now on kinetic 
theory and statistical mechanics were definittdy (‘staJilisluid. 

But beyond this pliysical result, Einstein’s theory of Brownian 
motion had a most important consequences for seituitilic! nustlio- 
dology in general. Ildio accuracy of measurement, dcpcsrids on the 
sensitivity of the instrimu^nts, and this a,gain on th(^ siz<^ and 
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weight of the mohile parts and the restoring forces acting on 
them. Before Einstein^ s work it was tacitly assumed that 
progress in this direction was limited only by experimental 
technique. I^ow it became obvious that this was not so. If an 
indicator, like the needle of a galvanometer, became too small 
or the suspending fibre too thin, it would never be at rest but 
perform a kind of Brownian movement. This has in fact been 
observed. Similar phenomena play a large part in modern 
electronic technique, where the limit of observation is given by 
irregular variations which can be heard as a ‘noise’ in a. loud 
speaker. There is a limit of observability given by the laws of 
nature themselves. 

This is a striking example that the code of rules for inference 
by induction, though perhaps metaphysical in some way, is 
certainly not a 'priori^ but subject to reactions from the know¬ 
ledge which it has helped to create. Tor those rules which taught 
the experimentalist how to obtain and improve the accuracy o± 
his findings contained to begin with certainly no hint that there 
is a natural end to the process. 

However, the idea of unlimited improvement of accuracy need 
not be given up yet. One had only to add the rule: make your 
measurements at as low a temperature as possible. For Hrownian 
motion dies down with decreasing temperature. 

Yet later developments in physics proved this rule also to be 
ineffective, and a much more trenchant change in the code had 
to be made. 

But before dealing wdth this question we have to finish our 
review of statistical methods in classical mechanics. 

GENERAL KINETIC THEORY 

Kinetic theory could only be regarded as complete if it 
applied to matter in (visible) motion as well as to equilibrium. 
But if you look through the literature you will find very little— 
a few simple cases. The most important of these, the theory of 
gases, has been dealt with in some detail. Two others must be 
mentioned: the theory of solids and of the Brownian motion. 

Ideal solids are crystal lattices or gigantic periodic molecules. 
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But only for zero temperature are the atoms in regularly spaced 
equilibrium positions; for higher temperatures they begin to 
vibrate. As long as the amplitudes are small, the mutual forces 
will be linear functions of them; then the vibrations can be 
analysed into "normal modes’, each of which is a wave running 
through the lattice with a definite frequency. These normal 
modes represent a system of independent harmonic oscillations 
to which Gibbs’s method of statistical mechanics can be applied 
without any difficulty. If, however, the temperature rises, the 
amplitudes of the vibrations increase and higher terms appear in 
the interaction: the waves are scattering one another and are 
therefore strongly damped. Hence there exists a kind of free 
path for the transport of energy which can be used for explaining 
conduction of heat in crystals (Debye). Similar considerations, 
applied to the electrons in metallic crystals, are used for the 
explanation of transport phenomena like electric and thermal 
conduction in metals. 

In the case of Brownian motion, I have already mentioned that 
Einstein calculated not only the mean density of a colloidal 
solution, say, under gravity, but also the mean square displace¬ 
ment of a single suspended particle in time (or, what amounts to 
the same, the dispersion of a colloid by diffusion as a function 
of time). The simplifying assumption which makes this possible 
is that the mass of the colloidal particle is large compared with the 
mass of the surrounding molecules, so that these impart only 
small impulses. Similar considerations have been applied to 
other fluctuation phenomena (see Appendix, 20). 

A great number of more or less isolated examples of non¬ 
equilibria have been treated by a semi-empirical method which 
uses the notion of relaxation time. You find a very complete 
account of such things related to solids and liquids in a book of 
J. Frenkel, Ki7ietic Theory of Liquids. But you must not expect 
to find in this work a systematic theory, based on a general idea, 
nor will you find it in any other book. 

My collaborator Dr. Green and I have tried to fill this gap, 
and to develop the kinetic theory of matter in general. I hope 
you will not mind if I indulge a little in the pleasure of explaining 

6131 
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tte leading ideas. It -will help the understanding of the interplay 

of cause and chance in the laws of nature. , .. i 

We have to remember the general princip es ai o 
Gibbs which he, however, used only for the case of statistical 

^An arbitrary piece of matter, fluid or solid, is, from the atomis¬ 
tic standpoint, a mechanical system of particles (atoms, mole¬ 
cules) defined by a Hamiltonian H. Its state is comp e e y 
determined if the initial values of coordinates and momenta are 
given. Actually this is not the case; but there is a probabibty 
(as yet unknown) dpdq for the initial distribution- o 

causal laws of motion demand that the distribution ^ 

a later time t is a solution of the Liouville equation (6.4) (p. 49) 


^ = [H,/] 0 , 

dt dt ^ 


namely that solution which for t == 0 becomes /®, 


(6.38) 


Let us assume for simplicity that all molecules are eqvia 
particles (point masses) with coordinates and velocities 

^(&) _ shall consider f to be a function of these and 

write/(i^, x,^). If we want to indicate thatafunction/depends on h 
particles, we do not write all arguments, but simply //i.( 1 > 2,.. ./fc) or 
shortly/^. As aH the particles are physically indistinguishable, we 
can assume aU the functions/^ to be symmetrical in the particles. 

Now the physicist is not directly interested in a symmetric 
solutionof (6.37). He wants to know such things as the num¬ 
ber density (number of particles per unit volume) 9 ^l(i^, x) at a 
given point x of space, or perhaps, in addition, the velocity dis¬ 
tribution/i(^, X, ^), i.e. just those quantities which are familiar 
from the kinetic theory of gases, depending on one particle only. 

One has therefore to reduce the function/^ for N particles step 
by step to the function of one particle. 

This is done by integrating over the position and velocity of 
one particle, say the last one, with the help of the integral 
operator ». 

= I I (6.39) 
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If/^+i is given, we obtain/^ by applying the operator it is, 
however, convenient to add a normalizing factor and to write 

(■^ (6.40) 

The physical meaning of the operation is this: we give up the 
pretence to know the whereabouts of one particle and declare 
frankly our ignorance. By repeated application of the operation, 
we obtain a chain of functions 

/iv» /v-i» •••> As /i> (6.41) 

to which one can add = 1; /^ means the probability of finding 
the system in a state where q particles have fixed positions (i.e. 
lie in given elements). The normalization is such that 

J fS, x®, 5®) d?® == X®) (6.42) 

is the number density; for one has 

J x<i>) = 

where the last equahty follows from (6.40) for g = 0 (with 
/o= !)• 

Now we have to reduce the fundamental equation (6.37) step by 
step by repeatedly applying the operator x Appendix, 21). 
Assuming that the atoms are acting on another with central 
forces, being the potential energy between two of them, the 
result of the reduction is a chain of equations of the form 

(<?=!, 2 ,..., N), (6.44) 

where A+i]- (6.45) 

This quantity /S'^ will be called the statistical term. What is the 
advantage of this splitting up of the problem into the solution of 
a chain of equations? The first impression is that there is no 
advantage at all; for to determineyou need to knowbut 
contains/ 2 , and this again depends on/^, and so on, so that one 
eventually arrives at /v, which satisfies the original equation. 
Yet this reasoning supposes the desire to get information about 
every detail of the motion, and that is just what we do not want. 


f f A dx®<Z?® = xi A == N, (6.43) 
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We wish, to obtain some observable and rather crude averages. 
Starting from/i and cUmbing up to/ 2 ,/ 3 ,-.., soon stop, as 

the chaos increases with the number of particles, and replace the 
rigorous connexion between/^ and/g^.! by an approximate one, 
according to the imperfection of observation. 

Before explaining the application of this “^method of ignorance^ 
to simple examples,.! wish to mention that we have actually 
foimd the chain of equations (6.44) in quite a different way, 
starting with/^ and using the calculus of probability for events 
not independent of one another (see Appendix, 22). 

This derivation is less formal than the first one and illuminates 
the physical meaning of the statistical term. 

It would now be very attractive to show how from this genex'al 
formula (6.44) the mechanical and thermal laws for continuous 
substances can be derived. But I have to restrict myself to a 
few indications concerning the general ‘method of ignorance’, 
to which I have already alluded. 

The first example is the theory of gases. We have seen that 
this theory is based on Boltzmann’s equation (6.23) 

^ = [^r,/(l)]+C(l), (6.46) 

where (7(1) is the collision integral (6.24); 

C'(l) = J [/'(l)/'(2)-/(l)/(2)]|5i-l2| d\>dTg^. (6.47) 

Now (6.46) has the same form as our general formula (6.44) for 
S' = Ij provided (7(1) can be identified with 

Green has shown that this is indeed the case, provided that tiie 
molecular forces have a small range ^ 0 ; then one can assume that 
in the gaseous state the probability of finding more than two 
particles m a distance smaller than Tq is negligible. In other 
wor<fe, one can exclude all except ‘binary’ encounters. Xwo 
particles outside the sphere of interaction can be regarded, as 
iiid. 6 p 6 iid.ent}; lienee one lias tliere 

A(l,2) =/^(l)/j(2). (6.48) 

yis holds ^o, in virtue of LiouviUe’s theorem, if on the left- 
hand side the positions and velocities refer to a point in th 
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interior of the sphere of action while on the right-hand side the 
values on its surface are used. With the help of this fact, the 
integration in can be performed (see Appendix, 23), and 
leads exactly to the expression (7(1), in which only the ‘boundary 
values’ of the functions/(l) and/( 2 ) on the surface of the sphere 
of action appear. 

Hence the whole kinetic theory of gases is contained as a 
special case in our theory. 

Concerning liquids, one must proceed in a different way, 
because triple and higher collisions cannot be handled with 
elementary formulae. We have adopted a method suggested by 
the American physicist Kirkwood. His formula is a generali¬ 
zation of (6.48), namely 


fdoo. _/2(2,3)/2(3, !)/,(!, 2) 


(6.49) 


and may be interpreted in different ways, e.g. by saying tliat the 
occurrences of three pairs of particles (2, 3), ( 3 , 1 ), (1,2) at given 
positions and with given velocities are almost in<l(>f)endent 
events, because the mutual interactions decrease rapidly with 
the distance. 

Substituting from (6.49) in aS^ 2 ^ obtains from (6.4/1), 

(6.46) two integro-diffcrential equations for/j andwhi(;h form 
a closed system and can be solved by suitable approxinuitions. 
(If then/g is calculated from the solution witli tlie lu4p of 

(6.49), the relation (6.40) for q 3 is not necessarily satislicul; 
this is the sacrifice of accuracy introduced by tlu^ Kirkwoo<l 
method.) 

All physical proy)erties of a liquid of tho kind discaisstMl lu^r<^ 
(particles with central forces) can be expressed in i(‘rms of 
ri 2 (l, 2), afunction known to the experimenters in X -ray nvscMindi 
on liquids as the radial distribution function, ’'riu^ nu'thod 
explained leads to explicit formulae for tlu^ e<pia,tion of state 
and the energy; it allows also a. discussion of t-lu’i Hingularity 
which separates the gaseous and licp.iid stat/'s. Hul. I (cannot 
enter into a discussion of details. 

Concerning non-equilibria, oiu^ can obtain tbc‘- dilhu'entiul 
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equations for the mechanical and thermal flow in a rigorous way; 
the result has, of course, the form of Cauchy’s equations (4.9) 
for continuous media, yet with a stress tensor which can be 
explicitly expressed in terms of the time derivatives of the strain 
tensor (or the space derivative of the velocity) and the gradient 
of temperature. In this way expressions for the coefficients of 
viscosity and thermal conductivity are obtained. They differ 
from the known formulae for gases by the great contribution of 
the mutual forces. Yet again I cannot dwell on this subject 
which would lead us far from the main topic of these lectures, 
to which I propose now to return (see Appendix, 33). 




VII 

CHANCE AND ANTECEDENCE 


What can we learn from all this about the general problem of 
cause and chance ? The example of gases has already shown us 
that the introduction of chance and probability into the laws of 
motion removes the reversibility inherent in them; or, in other 
words, it leads to a conception of time which has a definite 
direction and satisfies the principle of antecedence in the 
cause-effect relation. 

The formal method consists in defining a certain quantity, 


the entropy 


j f log fdpdq 
/ / dpdq 


(7.1) 


and showing that it never decreases in time : dJSIdt 0. In the 
case of a gas, the function / was the distribution function of 
one single molecule, a function of the point p,q of the phase 
space of this molecule. 

The same integral represents also the entropy of an arbitrary 
system in statistical mechanics, if / is replaced by the 
distribution function in the 2A-dimcnsional phase space; it 
satisfies all equilibrium relations of ordinary thermodynamics. 

In the case of a gas, the time derivative of S could bo deter¬ 
mined with the help of Boltzmann’s collision equation, and it 


was found that always 



(7.2) 


I have stressed the point that this is not in contradiction to 
the reversibility of mechanics; for this reversibility refers to a 
distribution function of non-interacting molecules, satisfying 

|=[H./], (7.3) 


while molecules colliding with one another satisfy 



(7.4) 
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where G is the collision integral. Irreversibility is therefore a 
consequence of the explicit introduction of ignorance into the 
fundamental laws. 

Now the same considerations hold for any system. If we take 
for / the function/jy of a closed system of N particles, (7.3) is 
again satisfied, and if its solution is introduced into (7.1), it can 
be easily shown that dSJdt = 0. 

Irreversihihty can be imderstood only by explicitly exempting 
a part of the system from causality. One has to abandon the 
condition that the system is closed, or that the positions and 
velocities of all particles are under control. The remarkable 
thing is that it suffices to assume one single particle beyond con¬ 
trol. Then we have to do with a system of iST-f-1 particles, but 
concentrate our interest only on N of them. The partition func¬ 
tion of these N particles satisfies the equation (6.44) for q — Ni 

(7.5) 

whereis a certain integral over/jy+j given by (6.45) for q ~ TV'. 
Tor a solution of this equation (7.6) the entropy is either constant 
or increasing. This is of course a fortiori the case if the system 
of N particles is coupled to more complicated systems out of 
control (see Appendix, 24). 

The increase of S continues until statistical equilibrium is 
reached, and it can he shown that the final distribution is the 
canonical one 

H{p,q) = E. (7.6) 

This result is, in my opinion, the final answer of the age-old. 
question how the reversibility of classical mechanics and the 
nreversihility of thermodynamics can be reconciled. The latter 
is due to a deliberate renunciation of the demand that in prin¬ 
ciple the fate of every single particle can be determined. You 
must viola-te mechanics in order to obtain a result in obvious 
contradiction to it. But one may say: this violation may be 
necessaiy from practical reasons because one can neither observe 
all particles nor solve the innumerable equations—in reality, 

owever, the world is reversible, and thermodynamics only a 
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trick for obtaming probable, not certain, results. This is the 
standpoint taken in many presentations of statistical mechanics. 
It is diiciilt to contradict if one accepts the axiom that the 
positions and velocities of all particles can, at least in principle, 
be determined—but can this really be maintained? We have 
seen that the Brownian motion sets a limit to all observations 
even on a macroscopic scale. One needs a spirit who can do 
things we could not even do with infinitely improved technical 
means. Further, the idea of a completely closed system is also 
almost fantastic. 

I think that the statistical foundation of thermodynamics is 
quite satisfactory even on the ground of classical mechanics. 

But in fact, classical mechanics has turned out to be defective 
just in the atomic domain where we have applied it. The whole 
situation has therefore to be re-examined in the light of quantum 
mechanics. 




VIII 

MATTER 

MASS, ENERGY, AND RADIATION 

In order not to lose sight of my main subject I have added to the 
heading of each section of these lectures words like 'cause’, 
‘contiguity’, ‘antecedence’, ‘chance’. The one for the present 
section, ‘matter’, seems to be an intruder. For classical philo¬ 
sophy teaches that matter is a fundamental conception of a 
specific kind, entirely different from cause, though on the same 
level in the hierarchy of notions: another ‘category’ in Kant’s 
terminology. This doctrine was generally accepted at the time 
before the great discoveries were made of which I have now to 
speak. It was the period when physics was governed by the 
dualism ‘force and matter’, Kraft und JStqff (the title of a popular 
book by Buchner). In modern physics this duality has become 
vague, almost obsolete. The first steps in this direction have been 
described in the preceding survey: the transition from Kewton’s 
distance forces to contact forces, first in mechanics, then in 
electromagnetism, and finally for gravitation; in other words, 
the victory of the idea of contiguity. If force is spreading in 
empty space’ with fimte velocity, space cannot be quite empty; 
there must be something which carries the forces. So space is 
ffled with ether, a kind of substance akin to ordinary matter 
in many respects, in which strains and stresses can be produced. 
Though these contact forces obey different laws from those which 
govern elasticity, they are still forces in an ether, something 
different from the carrier. Yet this distinction vanishes more 
and more. Relativity showed that the ether does not share with 
ordinary matter the property of ‘localization’: you cannot say 
here I am’; there is no physical way of identifying a point in the 
ether, as you could recognize a point in running water by a 
ttle mark, a particle of dust. Electric and magnetic stresses 
are not something in the ether, they are ‘the ether’. The question 
of a carrier becomes meaningless. 

However, this is a question of interpretation. Physicists are 



75 


MASS, ENERGY, AND RADIATION 

very broad-minded in this respect; they will continue using obso¬ 
lete expressions like ether, and no harm is done. Por them a 
matter of terminology is not serious until a new quantitative 
law is involved. That has happened here indeed. I refer to the 
law connecting mass m, energy e, and the velocity of light c 
(see Appendix, 25), ^ _ o /o ix 


which, after having been found to hold in special cases, was 
generally established by Einstein. His reasoning is based on the 
existence of the pressure of light, demonstrated experimentally 
and also derivable from Maxwell’s equations of electrodynamics. 
If a body of mass M emits a well-defined quantity of light in a 
parallel beam which carries the total electromagnetic energy €, 
it suffers a recoil corresponding to the momentum c/c transferred. 
It therefore moves in the opposite direction, and to avoid a clash 
with the mechanical law that the centre of mass of a system 
cannot be accelerated by an internal process, one has to ascribe 
to the beam of light not only an energy € and momentum e/c but 
also a mass c/c^, and to assume that the mass M of the emitting 
body is reduced by the same amount m = 

The theory of relativity renders this result quite natural. It 
provides, moreover, an expression for the dependence of mass 
on velocity; one has 




( 8 . 2 ) 


■where is called the rest-mass, 
then given by a 


Energy e and momentum p are 
p = mv. (8.3) 


I need hardly to remind you how this result of ‘purest science’ 
has been lately confirmed hy a terrifying, horrible, ‘technical’ 
ai)plication in New Mexico, dapari, and Bikini. There is no 
doubt, matter and energy arc the same. The old duality between 
the force and the substance on which it acts, has to be aban¬ 
doned, and hence also the original idea of force as the cause of 
motion. Wo see how old notions are dissolved by new ex¬ 
periences. It is this process which has led me to the abstract 


P 


t M. IB^oriXy Atomic l^hymcti (lila<-.kio), -t th od., Oh. 111. 2, p. 62; A. Vt.!, 

. 288. Soo ApiKJUtlix, 25. 
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of causality based only on the notion of physical 
dependence, hut transcending special theories which change 
according "fco tlic experimental situation. 

Returning to our immediate object, we learn from Einstein’s 
law that the atomistic conception of matter is necessarily con¬ 
nected with the atomistic conception of energy. In fact the 
existence of quanta of energy was deduced by Planck from the 
laws of heat radiation five years before Einstein priblishod his 

relation between mass and energy. 

Planck’s discovery opened the first chapter in the history of 
quantum theory, which corresponds to the years 1900 to 1913 
and could be entitled ‘Tracing the quantum by thermodynamicial 
and statistical methods’. The next chapter deals with, the period 
1913-25 when spectroscopical and electronic methods were in 
the foreground, while the last chapter describes tho birth and 
development of quantum mechanics. 

I cannot possibly give an account of this long and tedious 
development, but I shall pick out a few points which are not so 
well known and hardly found in text-books, beginning with sonuj 
remarks on the thermo-statistical quantum hunt. 

The problem which Planck solved was the determination of 
the density of radiation p in equilibrium with matter of a givcm 
temperature T as function of T and of the frequency v, so that 
p(v, T) dv is the energy per unit volume in the frequency interval 
dv. By purely thermodynamical methods several properties of 
this function were known: the temperature dependence of tho 
total radiation j p dv — aT^ (law of Stefan and Boltzmann) jnid 
the specification that p/v® is only a function of the quotient vj 
The problem remained to determine this function, and here 
statistical methods had to be used. 

One can proceed in two ways. Either one regards the radiation 
as being in equilibrium with a set of atoms which in tlieir intex*- 
action with radiation can be replaced by harmonic oscillators; 
then the mean energy of these can be calculated in terms of the 
radiation density and turns out to he proportional to it. '^CJiis 
was the method preferred by Planck. Or one regards the radia- 

t Law of Wien; see Atomic Physics^ Ch. VII. 1, p. 198; A. XXVJI, p. 843. 
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tion itself as a system of oscillators, each of these representing 
the amplitude of a plane wave. This method was used by Ray¬ 
leigh and later by Jeans. In both cases the relation between the 
mean energy u(v) of the oscillators of frequency v and the radia¬ 
tion density p is givep byf 

P = (8.4) 

and it suffices to determine u. 

This can be done with the help of the so-called equipartition 
law of statistical mechanics. Suppose the Hamiltonian H of a 
system has the form 

H = (8.5) 


where ^ is any coordinate or momentum and contains all 
the other coordinates and momenta but not i. Then the mean 
value of the contribution to the energy of this variable ^ is (see 
Appendix, 26) 


a 


k 


T, 


( 8 . 6 ) 


independent of the constant a —hence tlie same for all variables 
of that description. 

Applied to a set of oscillators of frecjncincy v, wlierc 


H 




m. 


(p2-h47r‘-2i/Y“)> 


(8.7) 


one obtains for tlie average energy 

vr - Ic/I\ 


hence, from (8.4), 


P 


87Tj/“ 


/y/\ 


( 8 . 8 ) 


( 8 . 1 >) 


This is called the Rayleigh-tlesa-ns ra,dial.ion Ibrimila. It is a 
rigorous consequence of chissicial stat-isl.ie.a.l mcichanit's, but 
nevertheless in obvious contradiction t.o Fads. It dotss not Civen 
lead to a finite total ra-diation, sinc^e p increases as r- with Fnv 
quency. The law is, howevavr, not (piit.e a,bHurd as it. a.gre(\s well 
with measurements for snuill fnajueneit's (long wave-s) or high 
temperatures. At the other eiul of tlu^ sjaal.runi, th(^ observtal 
t Atomic rhyNlcN, Ch. VII. 1, p. 2U! ; A. X X VM 11, p. 217. 
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energy density decreases again, and Wien has proposed for tl 
region an experimental law which would correspond to tj 
assumption that in (8.4) the oscillator energy is of the form 

u — Uq er^ttlTcT^ 

This loofa very much like a Boltzmann distribution. Accordir 
to Wien s displacement law it holds for high values of tl 



Fig. 1. 
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distribution law, according to which the probability of finding 
a system in a state with energy e is proportional to where 
^ = l/kT, one can express the mean square fluctuation of the 

(Aip = (e —€)2 = i2__-2 


in terms of the average energy e = u itself if the latter is given as 
function of temperature or of j3 (see Appendix, 20.10): 


(Ae)2 


du 

dp' 


( 8 . 11 ) 


Now this function 'u(p) is known for the two limiting cases: T 
large or p small, and T small or p large, from (8.8) and (8.10), 

(p~'^ for small B, 
for large 

Hence one has 


du _ ip~^ = small p\ 

dp u, large p. 


(8.13) 


Now Planck argues like this: the two limiting cases will corre- 
spond to the preponderance of two different causes, whatever 
they may be. A well-known theorem of statistics says that the 
mean square fluctuations due to independent causes are additive, 
het us assume that the condition of independence is here satisfied. 
Hence, if both causes act simultaneously, wc should have 


(A€)=^ 


du 

dp 




(8.U-) 


This is fl/ diiTGrcnlji^il Gcjua/tion for witli. tliG ^oMGrcil solut/ion 


The constant of integration a must vanish in order to have the 
limiting cases (8.15) all right. W^ion’s displacement law, accord¬ 
ing to which p/v'* === ^rru/a^v depends only on Tjv, leads then to 
e© = hv, where }i is a constant, known as Iflanck’s constant. 
The result is l laiick s lorniula for the mean oscillator energy 

- 1 

from which the radiation density follows according to (8.9); a 
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refined interpolation which turned out to be in so excellent agree¬ 
ment with experiment that Planck looked for a deeper explana¬ 
tion and discovered it in the assumption of energy quanta of 
finite size €q — hv. If the energy is a multiple of the integral 
(6.32) has to he replaced hy the sum 


00 

jZ = '^ ~ 

■w = 0 


1 

1 — 


(8.17) 


and then the usual procedure outlined in section 6 leads at once 
to the expression (8.16) for the oscillator energy u. 

Planck believed that the discontinuity of energy was a pro¬ 
perty of the atoms, represented by oscillators in their interaction 
with radiation, which itself behaved quite normally. Seven 
years later Einstein showed that indeed wherever oscillations 
occur in atomic systems, their energy follows Planck’s formula 
(8.16); I refer to his theory of specific heat of molecules and solids 
which opened more than one new chapter of physics. But this 
is outside the scope of these lectures.f 

Einstein had, however, arrived already in 1906 at the con- 
‘ elusion that radiation itself was not as innocent as Planck 
assumed, that the quanta were an intrinsic property of radia¬ 
tion and ought to be imagined to be a kind of particles hying 
about. In text-books this revival of Newton’s corpuscular theory 
of hght is connected with Einstein’s explanation of the photo¬ 
electric effect and similar phenomena where kinetic energy of 
electrons is produced by light or vice versa. This is quite correct, 
but not the whole story. Por it was again a statistical argument 
which led Einstein to the hypothesis of quanta of light, or 
photons, as we say to-day. 

He considered the two limiting cases (8.13) from a different 
point of view. Suppose the wave theory of light is correct, then 
heat radiation is a statistical mixture of harmonic waves of all 
directions, frequencies, and amplitudes. Then one can deter¬ 
mine the mean energy of the radiation and its fluctuation in a 
given section of a large volume. This calculation has been per¬ 
formed by the Dutch physicist, H. A. Lorentz, with the result 


t See Atomic Physics^ Ch. VTI- 2, p. 207. 
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that (Ap)2 == fQr any frequency, or expressed in terms of the 
equivalent oscillators, (Ae)‘^ = in agreement with the 
Rayleigh-Jeans case (small p, large T) in (8.13). Hence there 
must be something else going on besides the waves, for which 
(Ae)^ = €qU; what can this be? 

Suppose Planck’s quanta exist really in the radiation and let 
n be their number per unit volume and unit frequency interval. 
As each quantum has the energy gq — hv, one has i — u ~ nsQ, 
and {Ae)^ = €§(A? 2 )'^. Hence the fluctuation law in Wien’s case 
(large small T) of (8.13) can be written as 

{A^ 7h. (8.18) 


This is a well-known formula of statistics referring to the 
following situation: a great number of objects are distributed at 
random in a big volume and n is the number contained in a part. 
Then one has just the relation (8.18) between the average n 
and its mean square fluctuation (see Appendix, 20). So Einstein 
was led to the conclusion that the Wien part of the fluctuation 
of energy is accounted for by quanta behaving like index)ondent 
particles, and he corroborated it by taking into account, besides 
the energy, also the momentum hvjc of the quantum and the 
recoil of an atom produced by it. Tt was this result which en¬ 
couraged him to look for experimental evidence and led him to 
the well-known interpretation of the photo-electric effect as a 
bombardment of photons which knock out electron.s from the 
metal transferring their entu*gy to them. 

Expressed in terms of photon numbers the combined fluctua¬ 
tion law (8.14) reads 


(Aw)’*^ 


..L - 77, I a-’’ ~ 77,(77 -h 1) 

€() dp 


(8.19) 


with the general solutiem 

77 


I 

I 0_j ’ 


( 8 . 20 ) 


where a = 0 leads t(7 the correct va,luo for large T. But what if 

q: 0 ? 

Every physicist glancing at ttu'' last formula, will ix^cognize 
it as the Ho-callcd liovso I^^instKMn <liHt.riV)uti()n Ia.w for an uUkiI 
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gas of indistinguisiiabl© particles according to quantum theory- 
It is most remarkable that at this early stage of quantum theory 
Planck and Einstein have already hit on a result which was 
rediscovered much later (Einstein again participating) (see 
Appendix, 25, 32). In fact Planck’s interpolation can be inter¬ 
preted in modern terms as the first and completely successful 
attempt to bridge the gulf between the wave aspect and the 
particle aspect of a system of equal and independent components 
whatever they may be—^photons or atoms. 

I shall conclude this section by giving a short account of 
another consideration of Einstein’s which belongs to a later 
period of quantum theory, when Bohr’s theoiy of atoms was 
aheady well established, namely the existence of stationary 
states in the atoms which differ by finite amounts of energy 
content. Suppose the atom can exist in a lower state 1 and a 
higher state 2; transitions are possible by emission or absoi'p- 
tion of a light quantum of energy Cg—== cq, hence of frequency 
V = €q//i. On the other hand, according to Boltzmann’s law 
the relative number of atoms in the two states will be 

^ = e-jSe.. (8.21) 

Now one can write (8.20), with cx = 0, in the form 

(n+l)e-^^» = n, 

or, using (8.21), (8.22) 

For this equation Einstein gave the following interpretation: 
the left-hand side represents the number of quanta emitted per 
unit of tune from the A^ atoms in the higher state, the right- 
hand side those absorbed by the A^ atoms in the lower state, 

two processes which in equilibrium must of course cancel orio 
another. 

The absorption is obviously proportional to the number of 
atoms in the lower state, and to the number n of photo iib 
present, i.e. to ^tA^. Concerning the emission the term Ag signifies 
a spontaneous process, independent of the presence of radiation; 
it corresponds to the well-known emission of electromagnetic 
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waves by a vibrating system of charges. The other term is 
a new phenomenon which was signalled the first time in this paper 
of Einstein (later confirmed experimentally), namely a forced 

or induced emission proportional to the number of photons 
present. 

If we denote the number of spontaneous emissions by 
of induced emissions by of absorptions we 

learn from (8.22) that the probability coefficients (probabilities 
per unit time, per atom, and per light-quantum) are all equal; 


^ = ^12 = B^^. (8.23) 

This result had far-reaching consequences. The first is the exis¬ 
tence of a symmetric probabihty coefficient B^^, = ^21 transi¬ 
tion between two states induced by radiation. This became one 
of the clues for the discovery of the matrix form of quantum 
mechanics. 


The second point is seen if one considers, not equilibrium, but 
a process in time; Einstein’s consideration leads at once to the 
equation 


dn dN^ 
dt dt 




(8.24) 


which is of the type used by the chemists for the calculation of 
reaction velocities. One has, in their terminology, three com¬ 
peting reactions, namely two diatomic ones and one monatomic 
one. Now genuine monatomic reactions are rare in ordinarv 
chemistry, but abundant in nuclear chemistry; they were in 
fact until recently the only known ones, namely the natural 
radioactive disintegrations. If the radiation density is zero, 
n ~ 0, one has 

AN^, (8.25) 


dN. 


dt 


which is exactly the elementary law of radioactive decay, 
according to Kutherford and Soddy. It expresses the assumption 
that the disintegrations are purely accidental and completely 
independent of one another. 

Thus Einstein’s interpretation means the abandonment of 
causal description and the introduction of the laws of chance 
for the interaction of matter and radiation. 
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Although my programme takes me through the whole history 
of physics, I am well aware that it is a very one-sided account of 
what really has happened. It will not have escaped you that I 
believe progress in physics essentially due to the inductive 
method (of which I hope to say a little more later), yet the experi¬ 
mentalist may rightly complain that his efforts and achieve¬ 
ments are hardly mentioned. Yet as I am concerned with the 
development of ideas and conceptions, I may be permitted 
to take the skill and inventive genius of the experimenters for 
granted and to use their results for my purpose without detailed 
acknowledgement. 

The period about 1900, when quantum theory sprang from 
the investigations of radiation, was also full of experimental 
discoveries: radioactivity, X-rays, and the electron, are the 
major ones. 

In regard to the role of chance in physics, radioactivity was of 
special importance. As I said before, the law of decay is the 
expression of independent accidental events. Moreover, the 
decay constant turned out to be perfectly insensitive to all 
physical influences. There might be, of course, some internal 
parameters of the atom which determine when it will explode. 
Yet the situation is different from that in gas theory: there we 
know the internal parameters, or believe we know them, they 
are supposed to be ordinary coordinates and momenta; what 
we do not know are their actual values at any time, and we are 
compelled to take refuge in statistics because of this lack of 
detailed knowledge. In radioactivity, on the other hand, nobody 
had an idea what these parameters might be, their nature itself 
was unknown. However, one might have kindled the hope that 
this question would be solved and radioactive statistics reduced 
to ordinary statistical mechanics. In fact, just the opposite has 
happened. 

Hadioactivity is also important for our problem because it 
provided the means of investigating the internal structure of the 
atom. You know how Hutherford used a-particles as projectiles 
to penetrate into the interior of the atom, and found the nucleus. 
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This result, together with J. J. Thomson’s discovery of the elec¬ 
tron, led to the planetary model of the atom: a number of elec¬ 
trons surrounding the nucleus, bound to it by electric forces. The 
fundamental difficulty of this model is its mechanical instability. 

As long as nothing was known about the forces which keep the 
elementary particles in an atom together one could assume a 
law of force which allowed stable equilibrium states. An in¬ 
genious model of this kind is due to J. J. Thomson. But now one 
knew that the forces were electrostatic ones, following Coulomb’s 
law, and these could never guarantee the extraordinary stability 
of the actual atoms which survive billions of collisions without 
any change of structure. Bohr connected this difficulty with the 
facts of spectroscopy, and the result was his well-known model 
of the atom consisting of ‘quantized’ electronic orbits. 

Mentioning spectroscopy, I feel again sadly how I have to 
skip over great fields of research with a few words. 

The discovery of simple laws in line spectra was in fact a great 
achievement. Still more important than numerical formulae, 
like the one discovered by the Swiss schoolmaster Balmer for 
the hydrogen spectrum, was the rule found by Ritz (also a 
Swiss, who unfortunately died quite young), the so-called com¬ 
bination principle; it says that the frequencies of the spectral 
lines of gases can be obtained by forming differences of a single 
row of quantities which are called terms: 


(8.26) 


though not all of these differences appear as lines in the spectrum. 
Balmer’s formula for hydrogen is a special case where ^ 


namely 



(m = 3,4,...). 


The formula (8.26) gave Bohr the clue to the application of 
quantum theory. Multiplying it by Planck’s constant h he 
interpreted it as the energy difference between any 

two stationary states having the energies == hT.„^ {n = 1,2,...). 
This interpretation is a sweeping generalization of Planck’s 
original conception of discrete energy-levels of oscillators. It 
explained at once the stability of atoms against impacts with an 



86 


MATTBB 


energy smaller than a certain threshold, the difference between 
emission and absorption spectra (the latter being of the form 

where 1 means the ground state), and was in 
detail confirmed by the well-known experiments of Franck and 
Hertz (excitation of spectra by electron bombardment). 

However, I cannot continue to describe the whole develop-* 
ment of quantum theory because that would mean writing an 
encyclopaedia of physics of the last thirty-five years. I have 
given this short account of the initial period because it is fashion¬ 
able to-day to regard physics as the product of pure reason. 
Now I am not so unreasonable as to say that physics could 
proceed by experiment only, without some hard thinking, nor 
do I deny that the forming of new concepts is guided to some 
degree by general philosophical principles. But I know from my 
own experience, and I could call on Heisenberg for confirmation, 
that the laws of quantum mechanics were found by a slow and 
tedious process of interpreting experimental results. I shall try to 
describe the main steps of this process in the shortest possible way. 

Yet it must be remembered that these steps do not form a 
straight staircase upwards, but a tangle of interconnected alleys. 
However, I must begin somewhere. 

There was first the question whether the stationary states are 
certain selected mechanical orbits, and if so, which. Proceeding 
from example to example (oscillator, rotator, hydrogen atom), 
‘quantum conditions’ were found (Bohr, Wilson, Sommerfeld) 
which for every periodic coordinate q of the motion can be 
expressed in the form 

2 = <^pdq = hn, (8.27) 

where p is the momentum corresponding to q and the integration 
extended over a period. The most convincing theoretical 
argument for choosing these integrals I was given by Ehrenfest, 
who showed that if the system is subject to a slow external per¬ 
turbation, I is an invariant and therefore well suited to be 
equated to a discontinuous ‘jumping’ quantity Tin. 

Among these ‘adiabatic invariants’ I there is in particular the 
angular momentum of a rotating system and its component in a 
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given direction; if both are to be integer multiples of the strange 
conclusion is obtained that an atom could not exist in all orienta- 
tions -but only in a selected finite set. Tliis was confirmed by 
Stern and Gerlach’s celebrated experiment (deflecting an 
atomic beam in an inhomogeneous magnetic field). I am proud 
that this work was done in my department in Frankfort-on- 
Main. There is hardly any other effect which demonstrates the 
deviations from classical mechanics in so striking a manner. 

A signpost for further progress was Bohr’s correspondence 
principle. It says that, though ordinary mechanics does not 
apply to atomic processes, we must expect that it holds at least 
approximately for large quantum numbers. This was not so 
much philosophy as common sense. Yet in the hands of Bohr 
and his school it yielded a rich harvest of results, beginning with 
the calculation of the constant R in the Balmer formula, f The 
mysterious laws of spectroscopy were reduced to a few general 
rules about the energy-levels and the transitions between them. 
The most important of these rules was Pauli’s exclusion prin¬ 
ciple, derived from a careful discussion of simple spectra; it says 
that two or more electrons are never in the same quantum state, 
described by fixed values of the quantum numbers (8.27) 
belonging to all periods, including the electronic spin (Uhlenbeck 
and Goudsmit). With the help of these simple principles the 
periodic system of the elements could be explained in terms of 
electronic states. But all these great achievements of Bohr’s 
theory are outside the scope of our present interest. I have, 
however, to mention Bohr’s considerations about the correspon¬ 
dence between the amplitudes of the harmonic components of a 
mechanical orbit and the intensity of certain spectral lines. 
Consider an atom in the quantum state n with energy and 
suppose the orbit can be, for large approximately described 
by giving the coordinates q as functions of time. As these will 
be periodic, one can represent g as a harmonic (Fourier) series, 
of the type 

CO 

q{t) = 2 t3t,n(^)cos[27rv(?i)(mH-S„J], (8.28) 

m=l 

t So© Atomic Physical Ch. V. 1, p. 98; A. XIV, p. 300, 
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where the fundamental frequency v{n) and the amplitudes 
a^{n) depend on the number n of the orbit considered. In reality, 
the frequencies observed are not v{n), 2v{n), Zv{n),... but 

^nm ^ ^tn)» 

and what about the amplitudes ? It was clear that the squares 
|am(«-)I® should correspond in some -way to the transition proh- 
abilities — B^^ introduced by Einstein in his derivation 
of Planck’s radiation law (8.16). But how could the mth over¬ 
tone of the ^ 2 -th orbit be associated with the symmetric relation 
between two states 

This was the central problem of quantum physics in the years 
between 1913 and 1925. In particular there arose a great interest 
in measuring intensities of spectral Hues, with the help of newly- 
invented recording micro-photometers. Simple laws for the 
intensities of the component lines of multiplets were discovered 
(Ornstein, Moll), and presented in quadratic tables which look 
so much like matrices that it is hard to understand why this 
association of ideas did not happen in some brain. 

It did not happen because the mind of the physicist was stiU 
working on classical lines, and it needed a special effort to get 
rid of this bias. One had to give up the idea of a coordinate 
being a function of time, represented by a Fourier series like 
(8.28); one had to omit the summation in this formula and to 
take the set of unconnected terms as representative of the 
coordinate. Then it became possible to replace the Fourier 
amplitudes a^j,{n) by quantum amplitudes a{m,n) with two 
equivalent indices m, n, and to generalize the multiplication law 
for Fourier-coefficients into that for matrices*]' 

Cm(^) = 2 c(m, n) = ^ a{m, h)b{k, n). (8.29) 

k k 

Heisenberg justified the rejection of traditional concepts by a 
general methodological principle: a satisfactory theory should 
use no quantities which do not correspond to anything ob¬ 
servable. The classical frequencies mv{n) and the whole idea of 
orbits have this doubtful character. Therefore one should 
t See Atomic Physics, Cli. V. 3, p, 123; A. XV, p. 305. 
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eliminate them from the theory and introduce instead the 
quantum frequencies while the orbits should 

be completely abandoned. 

This suggestion of Heisenberg has been much admired as the 
root of the success of quantum mechanics. Attempts have been 
made to use it as a guide in overcoming the difficulties which 
have meanwhile turned up in physics (in the application of 
quantum methods to field theories and ultimate particles); yet 
with little success. IsTow quantum mechanics itself is not free 
from unobservable quantities. (The wave-function of SchrO- 
dinger, for instance, is not observable, only the square of its 
modulus.) To rid a theory of all traces of such redundant con¬ 
cepts would lead to unbearable clumsiness. I think, though 
there is much to be said for cleaning a theory in the way recom¬ 
mended by Heisenberg, the success depends entirely on scientific 
experience, intuition, and tact. 

The essence of the new quantum mechanics is the representa¬ 
tion of physical quantities by matrices, i.e. by mathematical 
entities which can be added and multiplied according to well- 
known rules just like simple numbers, with the only difference 
that the product is non-commutative. Por instance, the quan¬ 
tum conditions (8.27) can be transcribed as the commutation law 

ap—m = 

The Hamiltonian form of mechanics can be preserved by re¬ 
placing all quantities by the corresponding matrices. In par¬ 
ticular the determination of stationary states can be reduced 
to finding matrices q,p for which the Hamiltonian H{p^q) 
as a matrix has only diagonal elements which are then the energy- 
levels of the states. In order to obtain the connexion with 
Planck’s theory of radiation, the squares \q{m,n)\^ have to be 
interpreted as Einstein’s coefficients In this way a few 

simple examples could be satisfactorily treated. Eut matrix 
mechanics applies obviously only to closed systems with discrete 
energy-levels, not to free particles and colhsion problems. 

This restriction was removed by SchrOdinger’s wave mechan¬ 
ics which sprang quite independently from an idea of de Broglie 


A 

27r 




(8.30) 
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about the application of quantum theory to free particles. It is 
widely held that de Broglie’s work is a striking example of the 
power of the human mind to find natural laws by pure reason, 
without recourse to obser’c^ation. I have not taken part in the 
beginnings of wave mechanics, as I have in matrix mechanics, 
and cannot speak therefore from my own experience. Yet I 
think that not a single step would have been possible if some 
necessary foothold in facts had been missing. To deny this 
would mean to maintain that Planck’s discovery of the quantum 
and Einstein’s theory of relativity were products of pure think¬ 
ing. They were interpretations of facts of observation, solutions 
of riddles given by Nature—dijaicult riddles indeed, which only 
great thinkers could solve. 

De Broghe observed that in relativity the energy € of a particle 
is not a scalar, but the fourth component of a vector in space- 
time, whose other components represent the momentum p; on 
the other hand, the frequency v of a plane harmonic wave is also 
the fourth component of a space-time vector, whose other 
components represent the wave vector k (having the direction 
of the wave normal and the length A”^, where A is the wave¬ 
length). Now if Planck postulates that e == hv^ one is compelled 
to assume also p = ^k. Por light waves where Av = c, this had 
already been done by Einstein, who spoke of photons behaving 
like darts with the momentum p — ejc — hvjc. De Broglie 
applied it to electrons where the relation between e and p is 
more complicated, namely obtained from (8.3) by eliminating 
the velocity v: , v 2 

(c) (8.31) 

If a particle (€,p) is always accompanied by a wave (v, k) the 
phase velocity of the wave would be (using e = mc^, p = mv) 

vX — vjh = ejp — c^jv ^ c, (8.32) 

impossible result, as the principle of relativity 
excludes velocities larger than that of light. But de Broglie 
was not deterred by this; he observed that the prohibition of 
velocities larger than c refers only to such motions which can be 
used for sending time-signals. That is impossible by means 
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of a monochromatic wave. For a signal one must have a small 
group of waves, the velocity of which can he obtained, according 
to Bayleigh, by differentiating frequency with respect to wave 
number. Thus, from (8.31) and (8.32),f 


dv 


d& 

dp 


pc* 




(8.33) 


a most satisfactory result which completely justifies the formal 
connexion of particles and waves, though the physical meaning 
of this connexion was still mysterious. 

This reasoning is indeed a stroke of genius, yet not a triumph 
of a priori principles, but of an extraordinary capacity for com¬ 
bining and unifying remote subjects. 

I should say the same about the work of Schrodinger and Dirac, 
but you could better ask them directly what they think about the 
roots of their discoveries. I shall not describe them here in detail, 
but indicate some threads to other facts or theories. SchrOdinger 
says that he was stimulated by a remark of de Broglie that any 
periodic motion of an electron must correspond to a whole 
number of waves of the corresponding wave motion. This led 
him to his wave equation whose eigenvalues are the energy- 
levels of stationary states. He was further guided by the analogy 
of mechanics and optics known from Hamilton’s investigations; 
the relation of wave mechanics to ordinary mechanics is the 
same as that of undulating optics to geometrical optics. Then, 
looking out for a connexion of wave mechanics with matrix 
mechanics, Schrodinger recognized as the essential feature of a 
matrix that it represents a linear operator acting on a vector 
(one-column matrix), and came in this way to his operator cal¬ 
culus (see Appendix, 27); if a coordinate q is taken as an ordinary 
variable and the corresponding momentum as the operator 

n d 

~ idq’ 

the commutation law (8.30) becomes a trivial identity. Apply¬ 
ing the theory of sets of ortho-normal functions, he could then 
establish the exact relation between matrix and wave mechanics. 


t See Atomic Physics, Ch. IV- 5, p. 84; A. XI, p. 295. 
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It is most remarkable that the whole story has been developed 
by Dirac from Heisenberg’s first idea by an independent and 
formally more general method based on the abstract concept of 
non-commuting quantities (g-numbers). 

The growth of quantum mechanics out of three independent 
roots uniting to a single trunk is strong evidence for the inevita¬ 
bility of its concepts in view of the experimental situation. 

From the standpoint of these lectures on cause and chance it 
is not the formalism of quantum mechanics but its interpretation 
which is of importance. Yet the formalism came first, and was 
well secured before it became clear what it really meant: nothing 
more or less than a complete turning away from the predomi¬ 
nance of cause (in the traditional sense, meaning essentially 
determinism) to the predominance of chance. 

This revolution of outlook goes back to a tentative interpreta¬ 
tion which Einstein gave of the coexistence of light waves and 
photons. He spoke of the waves being a 'ghost field’ which has 
no ordinary physical meaning but whose intensity determines 
the probability of the appearance of photons. This idea could be 
transferred to the relation of electrons (and of material particles 
in general) to de Broghe’s waves. With the help of SchrOdinger’s 
wave equation, the scattering of particles by obstacles, the excita¬ 
tion laws of atoms under electron bombardment, and other 
similar phenomena could be calculated with results which con¬ 
firmed the assumption. 

I shall now describe the present situation of the theory in a 
formulation due to Dirac which is well adapted to comparing the 
new statistical physics with the old deterministic one. 
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QUANTUM MECHANICS 

In quantuin mechanics physical quantities or observables are 
not represented by ordinary variables, but by symbols which 
have no numerical values but determine the possible values of 
the observable in a definite way to be described presently. These 
symbols can be added and multiplied with the proviso that 
multiplication is non-commutative: is in general different 

from BA. I cannot deal with the most general aspect of this 
symbolic calculus, but shall consider a special representation, 
namely that where the coordinates of the particles are 

regarded as ordinary numbers. Then a definite state of a system 
is defined by a function Q'Od an observable A can be 

represented by a linear operator: Aijj{q) means a new function 
the result of operating with A on j//. If this result is, apart 
from a factor, identical with 

AiIj -= ail,, (9.1) 

^ is called an eigenfunction of A tind the constant a an eigenvalue. 
The whole sot of eigenvalues is characteristic for the operator 
A and represents the possible numerical values of the observable, 
which may be continuous or* discontimious. 

The coordiiiates q themselves can be conHi<lered to be opera¬ 
tors, namely umltijdication operators: q^ operating on ij, 
means multiplying i/j by r/^^. Operators whose eigenvalues are 
all real numbers are called real (or ‘Herrnitiari’) operators. It 
is clear that all physical ciuantitio.s have to bo represented by real 
operators, as the eigenvalues are supposed to represent the pos¬ 
sible results of nieasuring a physical quantity. One can easily 
see that not only the multiplication operators but also the 

// d 

momenta — --— are real. But for the formal argument one 

can also use complc^x operators, of the form C — A~\-iB (where 
i V—i), and its conjugate =--- A — iJB; then OC'^' can bo 
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sh-own. to be a real operator with only positive (or zero) eigen¬ 
values. 

If two observables are represented by non-commuting opera¬ 
tors, A and JB, their eigenfunctions are not all identical; if a is 
an eigenvalue of A belonging to such an eigenfunction, there is 
no state of the system for which a measurement can result in 
finding simultaneously for A and JB sharp numerical values d 
and b. 

The theory cannot therefore in general predict definite 
values of all physical properties, but only probability laws. 
The same experiment, repeated under identical and controllable 
conditions, may result in finding for a quantity A so many 
times Oj, so many times aQ, etc., and for ^ in the same way b^ 
or 62 J But the average of repeated measurements must be 
predictable. Whatever the rule for constructing the number 
which represents the average A of the measurements of A, it 
must, by common sense, have the properties that A-j-JB = A-j-IB 
and cA = cA, if c is any number. 

From this alone there follows an important result. Consider 
apart from the averages A, JB of two operators A, JB also their 
mean square deviations, or the ^spreadings’ of the measurements, 

BA = VP“-Z)2}, 8B = (9.2) 

then by a simple algebraic reasoning (see Appendix, 28), which 
uses nothing other than the fact stated above that ( 7 ( 7 * has no 
negative eigenvalues, hence CU^ > 0 , it is found that 

(9.3) 

where \A,B]^ L{^aB—BA) (9.4) 

^ the so-caUed "commutator’ of the two operators A, B. If this 
is specially applied to a coordinate and its momentum, A ~ p, 
B = q, one has [g'jp] = 1 , therefore 

Sp.Sg'>|. (9,5) 

This is Heisenberg s celebrated uncertainty principle which is 
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a quantitative expression for the effect of non-commutation on 
measurements, but independent of the exact definition of aver¬ 
ages. It shows how a narrowing of the range for the measured 
g-values widens the range for The same holds, according to 
(9-3), for any two non-commuting observables with the differ¬ 
ence that the "uncertainty’ depends on the mean of the commu¬ 
tator. 

These general considerations are, so to speak, the kinematical 
part of quantum mechanics. Now we turn to the dynamical 
part. 

Just as in classical mechanics, the dynamical behaviour of a 
system of particles is described by a Hamiltonian 

which is a (differential) operator. It is usually just taken over 
from classical mechanics (where, if necessary, products fike 
have to be "symmetrized’ into ^{^q-\-qp))- In Dirac’s relativistic 
theory of the electron there are, apart from the space coordinates, 
observables representing the spin (and similar quantities in 
meson theory); they lead to no fundamental difficulty and will 
not be considered here. 

Yet one remark about the Hamiltonian H has to be made, 
bearing on our general theme of cause and chance: H contains 
in the potential energy (and in corresponding electromagnetic 
interaction terms) the last vestiges of Newton’s conception of 
force, or, using the traditional expression, of causation. We 
have to remember this point later. 

In classical mechanics we have used a formulation of the laws 
of motion which applies just as well to a simple system, where all 
details of the motion are of interest, as to a system of numerous 
particles, where only statistical results are desired (and possible). 
A function/( j 5, g) of time and of all coordinates and momenta 
was considered; iip,q change with time according to the equa¬ 
tions of motion, the total change of f is given by 
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where [-ff,/] is the Poisson bracket 


TH n = 

L ’•'J 4^\dqjc^Pic ^Pic^qJ' 


(9.7) 


One recovers the canonical equations by taking for f 
simply qj^ or respectively. On the other hand, if one puts 
dfjdt = 0, any solution of this equation is an integral of tlie 
equations of motion, and from a sufficient number of such 
integralsg) == Cj^ one can obtain the complete solution 
giving all p, q as functions of t. 

But if this is not required, the same equation is also the means 
for obtaining statistical information in terms of a solution /, 
called the ‘distribution function’, as I have described in detail. 
/ is that integral of ^» 

f = [HJl (9.8) 


which for if == 0 goes over into a given initial distribution/()(^>, q). 
If, in particular, this latter function vanishes except in the 
neighbourhood of a given point q^ in phase space, or, in Dirac 

notation, if /q = — PQ)^ig[ — q^), one falls back to the case of 

complete knowledge, q^ andp^ being the initial values ofq and p. 

This procedure cannot be transferred without alteration to 
quantum mechanics for the simple reason that p and q cannot 
be simultaneously given fixed values. The uncertainty relation 
(9.5) forbids the prescribing of sharp initial values for all /> and q. 
Hence the first part of the programme, namely a completes 
owledge of the motion in the same sense as in classical mechan¬ 
ics, breaks down right from the beginning. Yet the second part, 
statistical prediction, remains possible. Pollowing Dirac, we 
ask which quantities have to replace the Poisson brackets (9.7) 
quantum theory, where all quantities are in general non- 
commuting These brackets [a.^] have a number of algobruic 
properties; the most important of them being 

o 


(9.9) 


[a, ^1+^2] = [a,A] + [a,yS2], 

[“» A ^2] = ^2] + 

* 0 ^* tliese shall hold also for non-commuting; 
quantities a and ft provided the order of factors is always 
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)reserved (as it is in (9.9)), then it can be shown (Appendix, 29) 
bat [q:,^] is exactly the commutator as defined by (9.4). 

Now one has to replace the function / in (9.8) by a time- 
Lependent operator p, called the statistical operator, and to 
determine p from the equation (formally identical with (9.8)): 

^ = (9-10) 

v^ith suitable initial conditions. To express these in a simple 
^^ay it is convenient to represent all operators by matrices in 
^'■space; A operating on a function ip{q) is defined by 

= J dq' (9-11) 

stands for all coordinates S'!, and q' for another set of 

values where A{q,q') is called the matrix represent¬ 

ing A, 

The product A..S is represented by the matrix 

AB(q,q') = J A(q,q')B(q'',q’) dq\ (9.12) 

-f now p and H are taken as such matrices, where the elements 
)f p depend also on time, (9.10) is a differential equation for 
and the initial conditions are simply 


= po(q,q'), 


(9.13) 


vhere p^ is a given function of the two sets of variables. 

The number of vector arguments in p for a system of N par- 
}icles is 2N, exactly as in the case of classical theory in the func- 
non f{p,q). But while the meaning of / depending on p,q is 
jbvious, that of p depending on two sets q,q' is not, except in 
)ne case, namely when the two sets are identical, q — q'; then 


}he function 


p{i> q, q) = Mi, q) 


(9.14) 


s the number density, corresponding to the classical 

J fii,q,P)dp = n{t,q). 

^uite generally, the classical operation of integrating over then’s 
s replaced by the simpler operation of equating the two sets 
>f g’s, g = g', or in matrix language, taking the diagonal elements 
>f p. 

6131 


H 
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TJie average of an observable A for a configuration q must be 
a real number A formed from p and A so that nA is linear in both, 
operators. The simplest expression of this kind is 

nA = i(pA-hAp)^^^^, (9.15) 

and this gives, in fact, all results of quantum mechanics usually 
obtained with the help of the wave function. For instance, the 
statistical matrix describing a stationary state where A has a 
sharp value a, belonging to the eigenfunction is 

p = zff(a,q)ip*(a,q'). (9.16) 

Then, from the definition (9.12) it follows easily that for this p 
and any real operator A one has 

Ap = pA = Cbp, (9.17) 

hence for q — q\ with (9.14), 

n{a,q) = \ilj{a,q)\\ A = a. (9.18) 

Thus we have obtained the usual assumption that g^)]® 

is the ‘probability’ (if normalized to 1) or ‘number density^ 
(if normalized to N) at the point q for the state a. (It must, 
however, be noted that for systems of numerous particles, like 
liquids in motion, other ways of averaging are useful, for instance 
for the square of a momentum instead ofri^ — ^{pp^-\-p^p)^^^ 
the expression i{pp^-\-p^p~{-2ppp)^^^, which, however, for uni¬ 
form conditions coincides with the former.) 

Let us consider the general stationary case where p is inde¬ 
pendent of time and therefore satisfies 

[H.pI^O, (9.19) 

Any solution of this equation, i.e. any quantity A which com¬ 
mutes with H, is called an integral of the motion, in analogy to 
the corresponding classical conception. B itself is, of course, an 
integral. AU integrals A^, Ag,..., have different eigenvalues 
Ai, Aa,..., for one and the same eigenfunction */f(Ai, Ag,...; g,, q^,.,A, 
or shortly ^(A, q) : 

Ai^ = Ai^, ... . (9.20) 

P can be taken as any function of the A’s; its matrix representa¬ 
tion is given by 


p(g. 2') = 2; q'), 


(9.21) 



99 


QUANTUM MECHANICS 
from 'which, one obtains, with (9.18), 

<g.) = p{q,q) == 2 P{\M{Kq)? = 2 Pi^)MKq)^ ( 9 . 22 ) 

A A 

This shows that the arbitrary coefficient P(X) is the probability 
of finding the system in the stationary state A. 

Dynamical problems arise in a somewhat different way from 
those in classical theory. There it has a definite meaning to 
speak about the motion of particles in a closed system, for 
instance of the orbit of Jupiter in the planetary system. In 
quantum theory a closed system settles do’wn in a definite 
stationary state, or a mixture of such states as given by (9.21). 
But then nothing is changing in time; one cannot even make an 
observation without interfering with the state of the system. 
In classical physios it is supposed that we have to do with an 
objective and always observable situation; the process of measur¬ 
ing is assumed to have no influence on the object of observation. 
I have, however, drawn your attention to the point that even 
in classical physics this postulate is practically never fulfiJled 
because of the Brownian motion which affects the instruments. 
We are therefore quite prepared to find that the assumption of 
‘harmless’ observations is impossible. 

The most general way of formulating a dynamical problem is 
to split the Hamiltonian in two parts 

H = Ho+K (9.28) 

where Hq describes what is of interest while V is of minor impor¬ 
tance, a so-called perturbation. V may also include external 
influences and depend explicitly on the time. This partition is, 
of course, arbitrary to a high degree; but it corresponds to the 
actual situation. If a water molecule HgO is assembled from its 
atoms, one can either ask what the stationary states of the whole 
system are, or one can consider the parts Hg and O and ask how 
the states of the hydrogen molecule Hg are changed by the 
approaching oxygen atom, or one can ask the same question 
for the HO radical and the H atom. The latter two are dynamical 
problems. 

Dynamical problems in quantum theory therefore, in contrast 
to thosein classical theory, cannot be defined without a subjective, 
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more or less arbitrary decision about what you are interes 
in. In other words, quantum mechanics does not describe 
objective state in an independent external world, but the asj 
of this world gained by considering it from a certain subjeo* 
standpoint, or with certain experimental means and arrai 
ments. This statement has produced much controversy, ; 
though it is generally accepted by the present generiitior 
physicists it has been decidedly rejected by just those two i 
who have done more for the creation of quantum physics tl 
anybody else, Planck and Einstein. Yet, with all rcspeci 
cannot agree with them. In fact, the assumption of absol 
observability which is the root of the classical concepts seems 
me only to exist in imagination, as a postulate which cannoi 
satisfied in reality. 

Assu min g the partition (9.23) one has to describe the sy ston 
terms of the integrals of motion A^, Ag,... of which are, h< 

ever, not integrals of motion of JEL. All operators are then to 
expressed as matrices in the eigenvalues A (Aj, Ag,...) of A^, 
for instance, the statistical operator p by the matrix pit\ A, 
The diagonal elements of this matrix 


P(^;A) == A,A) (<). 

represent the probability of a state A at time and they go o 
^ ^ coefficients P(A) which appear in the expans 

represent the initial probabilities. The funct 
P{t; A, A ) can be determined from the differential equation (9. 
y a inethod of successive approximations. My collaboriit 
r^n, as even found an elegant formula representing the co 
plete solution. To a second approximation one finds 


P(«,A) = P(A)+2 ^(A,A'){P(A')-P(A)}H-...; 

A 


(9.; 


tke coefficients are given by 


_ J, 


t 


(9.1 


stated- 

unaii m tne state A (see Appendix, 30). 
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Now equation (9.25) has precisely the form of the laws of radio¬ 
active decay, or of a set of competing mono-molecular reactions. 
The matrix J(A, A') obviously represents the probability of a 
transition or jump from the state A to the state A'. This inter¬ 
pretation becomes still more evident if one assumes that the 
A-values are practically continuous, as would be the case if the 
system allowed particles to fly freely about (for instance in radio¬ 
activity one has to take account of the emitted a-particles; in the 
theory of optical properties of an atom of the photons emitted 
and absorbed). If external influences are excluded, so that F 
does not depend on time, the integral (9.26) can be worked out 
with the result that J becomes proportional to the time 

J-(A,A') = j(A,A')«, (9.27) 

where j{\,X) = (9.28) 

n 

The last factor ^{E — E') says that j{X,X') differs from zero for 
two states A and A^ only if their energy is equal. j'(X, A') is 
obviously the transition probability per unit time, precisely 
the quantity used in radioactivity. 

By applying the formula (9.25) to the case of the interaction 
of an atom with an electromagnetic field one obtains the formula 
(8.24) which was used by Einstein in his derivation of Planck’s 
radiation law. There are innumerable similar applications, such 
as the calculation of the effective cross-sections of various kinds 
of collision processes, which have provided ample confirmation 
of the formula (9.25). 

INDETERMINISTIC PHYSICS 

There is no doubt that the formalism of quantum mechanics 
and its statistical interpretation are extremely successful in 
ordering and predicting physical experiences. But can our 
desire of understanding, our wish to explain things, be satisfied 
by a theory which is frankly and shamelessly statistical and 
indeterministic ? Can we be content with accepting chance, not 
cause, as the supreme law of the physical world ? 

To this last question I answer that not causahty, properly 
understood, is eliminated, but only a traditional interpretation 
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of it, consisting in its identification with determinism. I haTe 
taken pains to show that these two concepts are not identical. 
Cansality in my definition is the postulate that one physical 
situation depends on the other, and causal research means the 
discovery of such dependence. This is still true in quantum 
physics, though the objects of observation for which a depen¬ 
dence is claimed are different: they are the probabilities of 
elementary events, not those single events themselves. 

In fact, the statistical matrix p, from which these probabilities 
are derived, satisfied a differential equation which is essentially 
of the same type as the classical field equations for elastic or 
electromagnetic waves. T'or instance, if one multiplies the 
eigenfunction 4,(q) of the Hamiltonian H, Ht/j = EiIj, by 
the new function satisfies 


7id<f> 

i~^ 



(9.29) 


Hor a free particle, where H 


2m 


(Px+Py+Pl) 




2 m 


A, (9.29) 


(9.30) 


goes over tiie wave equation 

Although here only the first derivative with respect to time 
if' does not differ essentially from the ordinary wave 

equation J where the left-hand side is \ ~^\. One must remember 

\ c2 0^2 j 

that only = [,^j2 a physical meaning (as a probability), 
where satisfies the conjugate complex equation 

2mi 8(f> * 


^ dt 


A<f> 


Tor this pair of equations a change in the time direction (t —— t) 

can be compensated by exchanging ^ and which has no in¬ 
fluence on 

The same holds in the general ease (9.29), and we see that the 
differential equations of the wave function share the property 
of all classical field equations that the principle of antecedence 
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is violated: there is no distinction between past and future for 
the spreading of the probability density. On the other hand, the 
principle of contiguity is obviously satisfied. 

The differential equation itself is constructed in a way very 
similar to the classical equations of motion. It contains in the 
potential energy, which is part of the Hamiltonian, the classical 
idea of force, or in other words, the Newtonian quantitative 
expression for causation. If, for instance, particles are acting 
on one another with a Coulomb force (as the nucleus and the 
electrons in an atom), there appears in H the same timeless action 
over finite distance as in Newtonian mechanics. Yet one has the 
feeling that these vestiges of classical causality are provisional 
and will be replaced in a future theory by something more satis¬ 
factory; in fact, the difficulties which the application of quantum 
mechanics to elementary particles encounters are connected with 
the interaction terms in the Hamiltonian; they are obviously stiU 
too "'classical’. But these questions are outside the scope of my 
lectures. 

We have the paradoxical situation that observable events obey 
laws of chance, but that the probability for these events itself 
spreads according to laws which are in all essential features 
causal laws. 

Here the question of reality cannot be avoided. What really 
are those particles which, as it is often said, can just as well 
appear as waves ? It would lead me far from my subject to discuss 
this very difficult problem. I think that the concept of reahty is 
too much connected with emotions to allow a generally accept¬ 
able definition. For most people the real things are those things 
which are important for them. The reality of an artist or a poet 
is not comparable with that of a saint or prophet, nor with that 
of a business man or administrator, nor with that of the natural 
philosopher or scientist. So let me chng to the latter kind of 
special reality, which can be described in fairly precise terms. 
It presupposes that our sense impressions are not a permanent 
hallucination, but the indications of, or signals from, an external 
world which exists independently of us. Although these signals 
change and move in a most bewildering way, we are awaxe of 
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objects with invariant properties. The set of these invariants of 
our sense impressions is the physical reahty which our mind 
constructs in a perfectly unconscious way. This chair here looks 
chtterent with each movement of my head, each twinkle of my 
eye, yet I perceive it as the same chair. Science is nothing else 
than the endeavour to construct these invariants where they are 
not obvious. If you are not a trained scientist and look through 
a ^croscope you see nothing other than specks of light and 
colour, not objects; you have to apply the technique of biological 
science, consistmg in altering conditions, observing correlations, 
etc., to learn that what you see is a tissue with cancer cells, or 
something like that. The words denoting things are applied to 
permanent features of observation or observational invariants. 

n p ysics this method has been made precise by using mathe¬ 
matics. ^ere the invariant against transformation is an exact 

Klein in his celebrated Erlanger JProgrcimm has 
classified the whole of mathematics according to this idea, and 
tne same could be done for physics. 

From this standpoint I maintain that the particles are real, 
as they represent invariants of observation. We believe in the 
ezastence’ of the electron because it has a definite charge e and a 
efimte mass m and a definite spin 5 ; that means in whatever 
cmcumstances and experimental conditions you observe an effect 
which theory ascribes to the presence of electrons you find for 

same numerical values. 

account of these results, imagine 
e ec ron e a tiny grain of sand, having a definite position in 

theor^ another matter. In fact you can, even in quantum 

cannot do is to suppose it also to have a 
We velocity at the same time; that is impossible according to 
the uncertamty relation. Though in our everyday experience 

waca^asenbeto ordinaryboaesdefiuitepositionsaudvelocities! 

for dimensions which are 

th^° invariants of observation. But 

Smrso? I ^ we must use 

m as soon as we have made up our minds to describe certain 
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phenomena in terms of particles. Bohr has stressed the point that 
our language is adapted to our intuitional concepts. We cannot 
avoid using these even where they fail to have all the properties 
of ordinary experience. Though an electron does not behave like 
a grain of sand in every respect, it has enough invariant pro¬ 
perties to be regarded as just as real. 

The fact expressed by the uncertainty relation was first dis¬ 
covered by interpreting the formalism of the theory. An 
explanation appealing to intuition was given afterwards, namely 
that the laws of nature themselves prohibit the measurement 
with infinite accuracy because of the atomic structure of matter: 
the most delicate instruments of observation are atoms or 
photons or electrons, hence of the same order of magnitude as 
the objects observed. Niels Bohr has applied this idea with great 
success to illustrate the restrictions on simultaneous measure¬ 
ments of quantities subject to an uncertainty rule, which he calls 
‘ complementary ’ quantities. 

One can describe one and the same experimental situation 
about particles either in terms of accurate positions or in terms 
of accurate momenta, but not both at the same time. The two 
descriptions are complementary for a complete intuitive under¬ 
standing. You find these things explained in many text-books 
so that I need not dwell upon them. 

The adjective complementary is sometimes also appHed to 
the particle aspect and the wave aspect of phenomena—I think 
quite wrongly. One can call these ‘dual aspects’ and speak of a 
‘duality’ of description, but there is nothing complementary as 
both pictures are necessary for every real quantum phenomenon. 
Only in limiting cases is an interpretation using particles alone 
or waves alone possible. The particle case is that of classical 
mechanics and is applicable only to the case of large masses, 
e.g. to the centre of mass of an almost closed system. The wave 
case is that of very large numbers of independent particles, as 
illustrated by ordinary optics. 

The question of whether the waves are something ‘real’ or a 
fiction to describe and predict phenomena in a convenient way 
is a matter of taste. I personally like to regard a probability 
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wave, even in 3iV-dimensional space, as a real thing, certainly 
as more than a tool for mathematical calculations. For it has 
the character of an invariant of observation; that means it 
predicts the results of counting experiments, and we expect to 
find the same average numbers, the same mean deviations, etc., 
if we actually perform the experiment many times under the 
same experimental condition. Quite generally, how could we 
rely on probability predictions if by this notion we do not refer 
to something real and objective ? This consideration applies just 
as much to the classical distribution function /(i; ^5, O') as to the 
quantum-mechanical density matrix p{t; q,q'). 

The difference between / and p lies only in the law of propaga¬ 
tion, a difference which can be described as analogous to that 
between geometrical optics and undulatory optics. In the latter 
case there is the possibility of interference. The eigenfunctions 
of quantum mechanics can be superposed like light waves and 
produce what is often called ‘interference of probability’. 



B 

Fig. 2. 

This leads sometimes to puzzling situations if one tries to ex¬ 
press the observations only in terms of particles. Simple optical 
experiments can be used as examples. Assume a source A. of 
light iHuminating a screen jB with two slits and the 

light penetrating these observed on a parallel screen C. If only 
one of the slits is open, one sees a diffraction pattern around 
the point where the straight line AJB-^ hits the screen, with a 
bright central maximum surrounded by smaU fringes. When 
both slits are open and the central maxima of the diffraction 
pattern overlap, there appear in this region new interference 
fringes, depending on the distance of the two slits. 
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The intensity, i.e. the probability of finding photons on the 
screen, in the case of both slits open, is therefore not a simple 
superposition of those obtained when only one of the slits is 
open. This is at once understandable if you use the picture of 
probability waves determining the appearance of photons. For 
the spreading of the waves depends on the whole arrangement, 
and there is no miracle in the effect of shutting one slit. Yet if 
you try to use the particles alone you get into trouble; for then a 
particle must have passed one slit or the other and it is perfectly 
mysterious how a slit at a finite distance can have an influence on 
the diffraction pattern. Heichenbach, who has published a very 
thorough book on the philosophical foundations of quantum 
mechanics, speaks in such cases of ‘causal anomalies’. To avoid 
the perplexity produced by them he distinguishes between 
phenomena, i.e. things really observable, such as the appearance 
of the photons on the screen, and ‘inter-phenomena’, i.e. theoreti¬ 
cal constructions about what has happened to a photon on its 
way, whether it has passed through one slit or the other. He 
states rightly that the difficulties arise only from discussing 
inter-phenomena. ‘That a photon has passed through the slit 
is meaningless as a statement of a physical fact.’ If we want 
to make it a physical fact we have to change the arrangement 
in such a way that the passing of a photon through the slit B^ can 
be really registered; but then it would not fly on undisturbed, 
and the phenomenon on the screen would be changed. Reichen- 
bach’s whole book is devoted to the discussion of this type of 
difficulty. I agree with many of his discussions, though I object 
to others. For instance, he treats the interference phenomenon 
of two slits also in what he calls the wave interpretation; but here 
he seems to me to have misunderstood the optical question. In 
order to formulate the permitted and forbidden (or meaningless) 
statements he suggests the use of a three-valued logic, where the 
law of the ‘excluded middle’ {tertiuTn non datwr^ does not hold. 
I have the feeling that this goes too far. The problem is not one 
of logic or logistic but of common sense. For the mathematical 
theory, which is perfectly capable of accounting for the actual 
observations, makes use only of ordinary two-valued logics. 
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difficulties arise solely if one transcends actual observations and 
insists on using a special restricted range of intuitive images 
and corresponding terms. Most physicists prefer to adapt their 
imagination to the observations. Concerning the logical pro¬ 
blem itself, I had the impression when reading Reiclienbach’s 
book that in explaining three-valued logic ho constantly used 
ordinary logic. Thismay be avoidable or justifiable. I remember 
the days when I was in daily contact with Hilbert, who was 
working on the logical foundations of mathematics. Ho dis¬ 
tinguished two stages of logics: intuitive logic dealing with finite 
sets of statements, and formal logic (logistics), which he described 
as a game with meaningless symbols invented to deal with the 
infinite sets of mathematics avoiding contradictions (like that 
revealed in Russell’s paradox). But Godel showed that these 
contradictions crop up again, and Hilbert’s attempt is to-day 
generally considered a failure. I presume that three-valuod 
logic is another example of such a game with symbols. It is 
certainly entertaining, but I doubt that natural philosophy will 
gain much by playing it. 


Thinkmg in terms of quantum theory needs some eflort and 

considerable practice. The clue is the point which 1 have 

stressed above, that quantum mechanics does not describe a 

situation in an objective external world, but a definite experiinen- 

^1 arrangement for observing a section of the external world. 

it out this idea even the formulation of a dynamical problem 

m quantum theory is impossible. But if it is accepted, the fiinda- 

mentaJ mdeterminaoy in physical predictions becomes natural, 

arrangement can ever be absolutely precise. 

even the most fervent determinist cannot deny 

resea^^^^v mechanics has served us well in actual 

W ^ replaced 

bj a determimstic theory of the classical type. 

re^duira^e ^ *1^® °Ii^nces of such a coun tor- 

return to determiLm possibility of a 

experiences ma^^e us to 

y to alterations and even reversions. Yot 
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scanning the history of physics in the way we have done we see 
fluctuations and vacillations, but hardly a reversion to more 
primitive conceptions. I expect that our present theory will be 
profoundly modified. For it is full of difficulties which I have not 
mentioned at all—the self-energies of particles in interaction 
and many other quantities, hke collision cross-sections, lead to 
divergent integrals. But I should never expect that these 
difficulties could be solved by a return to classical concepts. I 
expect just the opposite, that we shall have to sacrifice some 
current ideas and tise still more abstract methods. However, 
these are only opinions. A more concrete contribution to this 
question has been made by J. v. Neumann in his brilhant book, 
Mathematische Grundlagen der Quantenmechanik. He puts the 
theory on an axiomatic basis by deriving it from a few postulates 
of a very plausible and general character, about the properties 
of ‘expectation values’ (averages) and their representation by 
mathematical symbols. The result is that the formalism of 
quantum mechanics is uniquely determined by these axioms; in 
particular, no concealed parameters can be introduced with the 
help of which the indeterministic description could be trans¬ 
formed into a deterministic one. Hence if a future theory 
should be deterministic, it cannot be a modification of the present 
one but must be essentially different. How this should be 
possible without sacrificing a whole treasure of well-established 
results I leave to the determinists to worry about. 

I for my part do not believe in the possibility of such a turn 
of things. Though I am very much aware of the shortcomings 
of quantum mechanics, I think that its indeterministic founda¬ 
tions will be pei’manent, and this is what interests us from the 
standpoint of these lectures on cause and chance. There remains 
now only to show how the ordinary, apparently deterministic 
laws of physics can be obtained from these foundations. 


QUANTUM KINETIC THEORY OF MATTER 

The main problem of the classical kinetic theory of matter 
was how to reconcile the reversibility of the mechanical motion 
of the ultimate particles with the irreversibility of the thermo- 
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dynamical laws of matter in bulk. This was achieved by pro¬ 
claiming a distinction between the true laws which are strictly 
deterministic and reversible but of no use for us poor mortals 
with our restricted means of observation and experimentation, 
and the apparent laws which are the result of our ignorance and 
obtained by a deliberate act of averaging, a kind of fraud or 
falsification from the rigorous standpoint of determinism. 

Quantum theory can appear with a cleaner conscience. It 
has no deterministic bias and is statistical throughout. It has 
accepted partial ignorance already on a lower level and need 
not doctor the final laws. 

In order to define a dynamical phenomenon one has, as we ha ve 
seen, to spht the system in two parts, one being the interesting 
one, the other a ‘perturbation’; and this separation is highly 
arbitrary and adaptable to the experimental arrangement to 
described. Now this circumstance can be exploited for the pro¬ 
blem of thermodynamics. There one considers two (or more) 
bodies first separated and in equilibrium, then brought into 

contact and left to themselves until equihbrium is again 
attained. 

Let Sm be the Hamiltonian of the first body, that of the 
second, and write 




Then this is the combined Hamiltonian of the separated bodies. 

H they are brought into contact the Hamiltonian wUl bo differ- 

ent^ namely 

^ = ^o+K (9.32) 

^Mkt^f ordinary matter in bulk will 

u. XXI and. that is just the case, 
the the behaviour of the combined system by 

grak o/morn " T/SI1^?“’ "" 

of motion A<^) A<^) integrals 

f°r^themtegr;k;f;;;«„,,„f all together 

ne separated bodies, represented 
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by Hq. Hence we can use the solution of the dynamical problem 
given before, namely (9.25), 

P(j5,A) = P(A)+ 2 *^(A,A'){P(A')~P(A)}+..., (9.33) 

A' 

where now A represents the sets of eigenvalues == (A^^^, A^^^,...) 
of Ai^), and A^^) = (A^^), A^^),...) of Ai^), 

Let us consider first statistical equilibrium. Then 

P(i, A) = P(A); 

hence the sum must vanish, and one must have 

P(A') == P(A) (9.34) 

for any two states A, A' for which the transition probability 
J{X,X) is not zero. But we have seen further that these quan¬ 
tities J (A, A') are in all practical cases proportional to the time 
and vanish unless the energy is conserved, B — E' (see formulae 
9.27,9.28). If we disregard cases where other constants of motion 
exist for which a conservation law holds (like angular momentum 
for systems free to rotate), one can replace P(A) by P{E), But 
as the total system consists of two parts which are practically 
independent, one has 

P(P) = P(A) = Pi(Aa))P2(A<2)), (9.36) 

where the two factors represent the probabilities of finding the 
separated parts initially in the states A^^^ and A<^\ This factoriza¬ 
tion need not be taken from the axioms of the calculus of pro¬ 
bability; it is a consequence of quantum mechanics itself. For 
if the energy is a sum of the form (9.31), the exact solution of the 
fundamental equation for the density operator 

% = [A P] (9-36) 

is p = PxP'iy where refers to the first system pa to the 
second as according to (9.24) P{t, A) = p{t\ A, A), the product 
formula (9.35) holds not only for the stationary case (as long as 
the interactions can be neglected). If now P^^(A<’^^) and P^>(A^^^) 
are the energies of the separated parts, one obtains from (9.35) 

P(p(i)4.p(2 )) ^ P3^(A<i))P2(A(2)), (9.37) 
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which is a functional equation for the three functions jP, 

The solution is easily found to he (see Appendix, 31) 

P = Pi = Pg = (9.38) 

with a = ai-^-ocg? P ~ P 1 +P 2 (9.39) 


and the same ^ in all three expressions. 

Thus we have found again the canonical distribution of Oibbs, 
with the modification that the energies appearing are not explicit 
functions of q and p (Hamiltonians) but of the eigenvalues 
A, A^2^ of the integrals of motion. 

This derivation is obviously a direct descendant of Maxwell’s 
first proof of his velocity distribution law which we discussed 
previously, p. 51. But while the argument of independence is not 
justifiable with regard to the three components of velocity, it is 
perfectly legitimate for the constants of motion A. The fact 
that the multiplication law of probabilities and the additivity 
of energies for independent systems leads to the exponential 
distribution law has, of course, been noticed and used by many 
authors, beginning with Gibbs himself. This reasoning becomes, 
with the help of quantum mechanics, an exact proof which shows 
the limits of validity of the results. Bor if there exist constants 
of motion other than the energy, the distribution law lias to 
he modified, and therefore the whole of thermodynamics, '^rhis 
happens for instance for bodies moving freely in space, like stars, 
where the quantity ^ — IjlcT is no longer a scalar but the time 
component of a relativistic four-vector, the other components 
representing where v is the mean velocity of the body . Yet 
this is outside the scope of these lectures. 


The simplest and much discussed application of quantum 
statistics k that to the ideal gases. It was Einstein who first 

\ very low temperatures deviations from tho 

c assicaJ laws should appear. The Indian physicist, Bose, had 
• ^ one can obtain Planck’s law of radiation by regal’d- 

‘Ptoton gas’ provided one did not treat 
nlete]v°i°^ “^dividual recognizable particles but as corn- 

transferred this idea to 
ms. Later it was recognized that this so-called Bose- 
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Einstein statistics was a straightforward consequence of quan¬ 
tum mechanics; about the same time Eermi and Dirac discovered 
another similar case which applies to electrons and other particles 
with spin. 

In the language used here the two ‘statistics’ can be simply 
characterized by the symmetry of the density function 

Xa,..., x^; xi, Xg,..., x^). 

It is always symmetric, for indistinguishable particles, in both 
sets of arguments, i.e. it remains unchanged if both sets are 
subject to the same permutation. If, however, only one set is 
permuted, p remains also unchanged in the Bose—Einstein case 
for all permutations, while in the Fermi—Dirac case it does so 
only for even permutations, and changes sign for odd permuta¬ 
tions. 

Aj)phed to a system of free particles of equal structure, one 
obtains at once from the canonical distribution law the pro¬ 
perties of so-called degenerate gases. But as these are treated in 
many text-books, I shall not discuss them here (see Appendix, 
32). 

After having considered statistical equilibrium we have now 
to ask whether quantum mechanics accounts for the fact that 
every system approaches equilibrium in time by the dissipation 
of visible energy into heat, or, in other words, whether the 
jST-theorem of Boltzmann holds. 

This is the case indeed, and not difficult to prove. One defines 
the total entropy, just as in classical theory, by 

J P(^, A)log P{t, A) 

s = -fc-i , (9.40) 

A 

where the summations are to be extended over all values of the 
Ai, Ag,...; i.e. for each separate part of a coupled system over 
A^^^, A^^^,... and AJ^^^ A^^^..., respectively, and for the whole system 
over both sets. For loosely coupled systems the probabilities 
are, as we have seen, multiplicative at any time: 

P{t‘, AW, A(2)) = P^(«,AW)P2(^,AW). 

I 


61.31 


(9.41) 
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From this it follows easily that the entropies are additive, 

S = (9.42) 

N^ow substitute into (9.40) the explicit expression for P(^,A) 
from (9.33) which holds for weak coupling; then by neglecting 
higher powers of the small quantities J(A, A') one obtains 

, 2 y)Q{\ A') 

^ = . ( 9 . 43 ) 

A 

where ^(A,A') = {P(A)—P(A')}log-^^. (9.44) 

The transition probabilities J (A, A') are, as we have seen, in all 

practical cases proportional to time and vanish for transitions, 

for which energy is not conserved; one has, according to (9.27) 

and (9.28), ^ 

J(A,A') = ^^1F(A,A')12S(P—P'), (9.45) 

n 

where V is the interaction potential. These quantities J(A, A') 

are always positive. So is the denominator ^ P(A), while ^(A, A') 

A 

is positive as long as P(A) diflers from P(A'). 

Hence S increases with time and will continue to do so, until 
statistical equilibrium is reached. For only then no further 
increase of 8 will happen, as is seen by taking equilibrium as the 
initial state (where according to (9.34) Q{X,X') = 0 for all non¬ 
vanishing transitions). 

It remains now to investigate whether quantum kinetics 
leads, for matter in bulk, to the ordinary laws of motion and ther¬ 
mal conduction as formulated by Cauchy. This is indeed the 
case, as far as these laws are expressed in terms of stress, energy, 
and dux of matter and heat. Yet, as we have seen, this is only 
half the story, since Cauchy’s equations are rather void of mean¬ 
ing as long as the dependence of these quantities on strain, 
temperature, and the rate of their changes in space and time are 
not given. Now in these latter relations the difference between 
quantum theory and classical theory appears and can reach vast 
proportions under favourable circumstances, chiefly for low 
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temperatures. The theory sketched in the following is mainly 
due to my collaborator, Green. 

The formal method of obtaining the hydrothermal equations 
is very similar to that used in classical theory. Starting from the 
fundamental equation for N particles 

% = (9.46) 


a reduction process is apphed to obtain similar equations for 
N — 1, N — 2,..., particles, until the laws of motion for one particle 
are reached. 

The reduction consists, as in classical theory, in averaging 
over one, say the last, particle of a set. The coordinates of each 
particle appear twice as arguments of a matrix 




Put here x^”-^ = x^^^' and integrate over x^’^^; the result is Xn Pny ^ 
matrix which depends only on x<i>,..., xW',..,, x^"~i>'. 

With the same normalization as in classical theory, (6.40), p. 67, 
we write 

X«+i (9-47) 


By applying this operation several times to (9.46) one obtains 
(see Appendix, 33) 


^ pJ+S, (q = 1,2,..., N), (9.48) 

where = 2 x«+i Pa+i]. (9-49) 

in full analogy with the corresponding classical equations (6.44), 
(6.45). Here means the Hamiltonian of q particles, 
the interaction between one of these {i) and a further particle 
(g-f-l), and^Sg is called, as before, the statistical term. 

The quantity Pq(x, x) = -^^(x) represents the generalized 
number density for a ‘cluster’ of q particles, and in particular 
'y 2 -i(x) is the ordinary number density. 

Now one can obtain generalized hydro-thermodynamical 
equations from (9.47) by a similar process to that employed in 
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classical theory. Instead of integrating over tlie velocities one 
has to take the diagonal terms of the matrices (putting x = x'), 
and one has to take some precautions in regard to non-commnta- 
tivity by symmetrizing products, e.g. replacing by ^{oc^-\-poc) 
(see Appendix, 33). Exactly as in the classical equations of 
motion there appears the average kinetic energy of the particle 
(i) in a cluster of g particles which, divided by may be called 
a Mnetic temperature of the particle (^) in the cluster of q par¬ 
ticles. One might expect that the quantity corresponds to the 
ordinary temperature; but this is not the case. 

It is well known from simple examples (e.g. the harmonic 
oscillator) that in quantum theory for statistical equilibrium the 
thermodynamic temperature T, defined as the integrating de¬ 
nominator of entropy, is not equal to the mean square momen¬ 
tum. Here in the case of non-equilibrium it turns out that not 
only this happens, but that a similar deviation occurs with regard 
to pressure. The thermodynamical pressure p is defined as the 
work done by compression for unit change of volume; the kinetic 
pressure^j^is the isotropic part of the stress tensor in the equations 
of motion. These two quantities differ in quantum theory. 

Observable effects produced by this difference occur only for 
extremely low temperatures. Eor gases these are so low that 
they cannot be reached at all because condensation takes place 
long before. Most substances are sofid crystals in this region of 
temperature; for these one has a relatively simple quantum 
theory, initiated by Einstein, where the vibrating lattice is 
regarded as equivalent to a set of oscillators (the ‘normal modes ’). 
This theory represents the quantum effects in equilibrium 
(specific heat, thermal expansion) fairly well down to zero 
temperature, while the phenomena of flow are practically 
unobservable. 

There are only two cases where quantum phenomena of flow 
at very low temperatures are conspicuous. One is liquid helium 
which, owing to its small mass and weak cohesion, does not 
crystallize under normal pressure even for the lowest tempera¬ 
tures and becomes supra-fluid at about 2° absolute. The other 
case is that of the electrons in metals which, though not an 
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ordinary fluid, behave in many respects like one and, owing to 
their tiny mass, exhibit quantum properties, the strangest of 
which is supra-conductivity. 

In order to confirm the principles of quantum statistics, 
investigations of these two cases are of great interest. Both 
have been studied theoretically in my department in Edinburgh, 
and I wish to say a few words about our results. 

In the supra-fluid state helium behaves very differently from a 
normal liquid. It appears to lose its viscosity almost completely; 
it flows through capillaries or narrow slits with a fixed velocity 
almost independent of the pressure, creeps along the walls of the 
container, and so on. A metal in the supra-conductive state has, as 
the name says, no measurable electrical resistance and behaves 
abnormally in other ways. A common and very conspicuous 
feature of both phenomena is the sharpness of the transition 
point which is accompanied by an anomaly of the specific 
heat: it rises steeply if the temperature approaches the critical 
value jPg from below, and drops suddenly for T — 3^, so that the 
graph looks like the Greek letter A; hence the expression A-point 
for T^. However, this similarity cannot be very deeply rooted. 
Where has one to expect, from the theoretical standpoint, the 
beginning of quantum phenomena ? Evidently when the momen¬ 
tum p of the particles and some characteristic length I are reaching 
the limit stated by the uncertainty principle, pi ~ If we 
equate the kinetic energy p’^j^m to the thermal energy IcT, 
the critical temperature will be given by hT^ ^ If one 

substitutes here for k and H the well-known numerical values and 
for m the mass of a hydrogen atom times the atomic mass 
number /x, one finds, in degrees absolute, 

3 ; — ^, (9.50) 

where I is measured in Angstrom units (10~®cm.). 

Eor a helium atom one has p, = 4, and if I is the mean distance 
of two atoms (order 1 A) one obtains for a few degrees, which 
agrees with the observed transition at about 2°. But for elec¬ 
trons in metals one has p = 1/1840 . If one now assumes one 
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electron per atom and interprets I as their mean distance it would 
be again of the order 1 A, hence the expression (9.50) would 
become some thousand degrees and has therefore nothing to do 
with the A-point of supra-conductivity. This temperature has, 
in fact, another meaning; it is the so-called ‘degeneration tem¬ 
perature’ Tg of the electronic fluid; below Tg, for instance at 
ordinary temperatures, there are already strong deviations from 
classical behaviour (e.g. the extremely small contribution of the 
electrons to the specific heat), though not of the extreme charac¬ 
ter of supra-conductivity. In order to explain the A-point of 
supra-conductivity which lies for all metals at a few degrees 
absolute, one has to take Z about 200 times larger 200 A), 
As the interpretation of this length is still controversial, I shall 
not discuss supra-conductivity any further (see Appendix, 34). 

ISTor do I intend in the case of supra-fluidity of helium to give 
a full explanation of the A-discontinuity, but I wish to direct 
your attention to the thermo-mechanical properties of the 
supra-liquid below the A-point, called He II. 

I have already mentioned that in quantum liquids one has to 
distinguish the ordinary thermodynamic temperature T and 
pressure p from the kinetic temperature and pressure 
The hydro-thermal equations contain only and and these 
quantities are constant in equilibrium, i.e. for a state where no 
change in time takes place. But T-^ and p-^ are not simple 
functions of T and p but depend also on the velocity and its 
gradient. Therefore in such a state permanent currents of mass 
and of energy may flow as if no viscosity existed. This is reflected in 
the energy balance which can be derived from the hydro-thermal 
equations. One obtains a curious result which looks like a viola¬ 
tion of the flrst law of thermodynamics; for the change of heat 
is given by 

dQ = TdS = dU-{-pdV—Vd7T, (9.51) 

where all symbols have the usual meaning, and tt — pj^ — p is 
the difference of the kinetic and thermodynamic pressures. This 
equation differs from the ordinary thermodynamical expression 
(5.12) by the term — V drr; how is this possible if thermodynamics 
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claims rightly universal’ validity ? This claim is quite legitimate, 
but the usual form of the expression for dQ depends on the 
assumption that a quasi-static, i.e. very slow, process can be 
regarded as a sequence of equilibria each determined by the 
instantaneous values of pressure and volume. This is correct in 
the classical domain, because if the rate of change of external 
action (compression, heat supply, etc.) is slowed down, all veloci¬ 
ties in the fluid tend to disappear. Not so in quantum mechanics. 
In consequence of the indeterminacy condition the momenta or 
the velocities cannot decrease indefinitely if the coordinates of 
the particles are restricted to very small regions. An investiga¬ 
tion of the hydro-thermal equations shows that this effect is 
preserved, to some degree, even for the visible velocities; it is 
true there can exist a genuine statistical equilibrium where the 
density is uniform and the currents of mass and energy vanish, 
but there are also those states possible where certain combina¬ 
tions of currents of mass (velocities) and of energy (heat) per¬ 
manently exist. The production of these depends entirely on 
the way in which the heat dQx^ supplied to the system and cannot 
be suppressed by just making the rate of change of volume very 
small. We have therefore not a breakdown of the law of con¬ 
servation of energy but of its traditional thermodynamical 
formulation. 

The consequences of that extra term in (9.51) are easily seen 
by introducing instead of the internal energy the quantity 

M = U—ttV (9.52) 

in the expression (9.51) for dQ, which then reads 

dQ — dE-{-pj^dV, (9.53) 

4 

where == p-f-Tr is the kinetic pressure. This shows that the 
specific heat at constant volume is 



not (dU/dT)^, as in classical thermodynamics. Now as and 
therefore tt — p^ — p, is very large at T = 0 and decreases with 
increasing T to reach the value 0 at the A-point, one obtains for 
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c^{T) a curve exactly of the form actually observed. Hence the 
A-anomaly is due to the coupling of heat currents with the 
mass motion characteristic of quantum liquids. It is a molar, 
macroscopic motion, the shape of which depends on the geo¬ 
metrical conditions, presumably consisting of tiny closed threads 
of fast-moving liquid, or groups of density waves. 

A similar conception has been derived by several authors 
(Tisza, Mendelssohn, Landau) from the experiments; they speak 
of the liquid being a mixture of ordinary atoms and special 
degenerate atoms ( 2 J-particles) which are in the lowest quantum 
state and carry neither energy nor entropy. Yet in a liquid one 
cannot attribute a quantum state to single atoms. 

These considerations are also the clue to the understanding 
of other 'anomalous phenomena, as the flow through narrow 
capillaries or shts, the so-called fountain effect, the "second 
sound’, etc. Green has studied the properties of He II in detail 
and arrived at the conclusion that the quantum theory of liquids 
can account for the strange behaviour of this substance. 

I have dwelt on this special problem in some detail as it reveals 
in a striki n g way that quantum phenomena are not confined to 
atomic physics or microphysics where one aims at observing 
single particles, but appear also in molar physics which deals 
with matter in bulk. From the fundamental standpoint this 
distinction, so essential in classical physics, loses much of its 
meaning in quantum theory. The ultimate laws are statistical, 
and the deterministic form of the molar equations holds for 
certain averages which for large numbers of particles or quanta 
are all one wants to know. 

Now these molar laws satisfy all postulates of classical 
causality: they are deterministic and conform to the principles 
of contiguity and antecedence. 

With this statement the circle of our considerations about 
cause and chance in physics is closed. We have seen how classical 
physics struggled in vain to reconcile growing quantitative 
observations with preconceived ideas on causality, derived from 
everyday experience but raised to the level of metaphysical 
postulates, and how it fought a losing battle against the intrusion 
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The statistical interpretation which. I have presented in the last 
section is now generally accepted by physicists all over the world, 
with a few exceptions, amongst them a most remarkable one. 
As I have mentioned before, Einstein does not accept it, but still 
believes in and works on a return to a deterministic theory.. To 
illustrate his opinion, let me quote passages from two letters. 
The first is dated 7 November 1944, and contains these lines: 

‘In imserer wissenschaftlichen Erwartung haben wir uns zu Antipoden 
entwickelt. Eu glaubst an den wiirfelnden Gott nnd ich an voile Gesetz- 
lichkeit in einer Welt von etwas objektiv Seiendem, das ich anf wild 
speknjativem Weg zu erhaschen suche. Ich hoffe, dass einer einen mehr 
realistischen Weg, bezw. eine mehr greifbare Unterlage fiir eine solche 
Auffassimg finden wird, als es mir gegeben ist. Der grosse anfangliche 
Erfolg der Quantentheorie kann mich doch nicht zum Glauben an das 
fundamentale Wiirfelspiel bringen. ’ 

(In our scientific expectations we have progressed towards antipodes. 
You believe in the dice-playing god, and I in the perfect rule of law in a 
world of something objectively existing which I try to catch in a wildly 
speculative way. I hope that somebody will find a more realistic way, 
or a more tangible foundation for such a conception than that which is 
given to me. The great initial success of quantum theory cannot convert 
me to believe in that fundamental game of dice.) 

The second letter, which arrived just when I was writing these 
pages (dated 3 December 1947), contains this passage: 

‘Meine physikalische Haltung kann ich Dir nicht so begriinden, dass 
Du sie irgendwie vemiinftig finden wiirdest. Ich sehe natiirlich ein, dass 
die principiell statistische Behandlungsweise, deren Notwendigkeit im 
Rahmen des bestehenden Formalismus ja zuerst von Dir klar erkannt 
wurde, einen bedeutenden Wahrheitsgehalt hat. Ich kann aber deshalb 
nicht emsthaft daran glauben, weil die Theorie mit dem Grundsatz 
unvereinbar ist, dass die Physik eine Wirklichkeit in Zeit rmd Raum 
darstellen soli, ohne spukhafte Femwirkungen. . . . Davon bin ich fest 
uberzeugt, dass man schliesslich bei einer Theorie landen wird, deren 
gesetzmassig verbundene Dinge nicht Wahrscheinlichkeiten, sondern 
gedachte Tatbestande sind, wie man es bis vor kurzem als selbstver- 
standlich betrachtet hat. Zxrr Begriinduag dieser tlberzeugxmg kann 
ich aber nicht logische Griinde, sondern nur meinen kleinen Finger als 
Zeugen beibringen, also keine Autoritat, die ausserhalb meiner Haut 
irgendwelchen Respekt einflossen kann.’ 
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(I cannot substantiate my attitude to physics in such a manner that 
yotx would jSnd it in any way rational. I see of course that the statistical 
interpretation (the necessity of which in the frame of the existing for¬ 
malism has been first clearly recognized by yourself) has a considerable 
content of truth. Yet I cannot seriously believe it because the theory 
is inconsistent with the principle that physics has to represent a reality 
in space and time without phantom actions over distances. ... I am 
absolutely convinced that one will eventually arrive at a theory in which 
tbe objects connected by laws are not probabilities, but conceived facts, 
as one took for granted only a short time ago. However, I cannot provide 
logical arguments for my conviction, but can only call on my little finger 
as a witness, which cannot claim any authority to be respected outside 
my own skin.) 

I have quoted these letters because I think that the opinion 
of the greatest living physicist, who has done more than anybody 
else to establish modern ideas, must not be by-passed. Einstein 
d-oes not share the opinion held by most of us that there is over¬ 
whelming evidence for quantum mechanics. Yet he concedes 
^initial success’ and ‘a considerable degree of truth’. He ob¬ 
viously agrees that we have at present nothing better, but he 
hopes that this will be achieved later, for he rejects the ‘dice¬ 
playing god’. I have discussed the chances of a return to deter¬ 
minism and found them slight. I have tried to show that classical 
physics is involved in no less formidable conceptional difficulties 
and had eventually to incorporate chance in its system. We 
mortals have to play dice anyhow if we wish to deal with atomic 
systems. Einstein’s principle of the existence of an objective 
real world is therefore rather academic. On the other hand, his 
contention that quantum theory has given up this principle is 
not justified, if the conception of reality is properly understood. 
Of this I shall say more presently. 

Einstein’s letters teach us impressively the fact that even an 
exact science like physics is based on fundamental beliefs. The 
words ich glaube appear repeatedly, and once they are under¬ 
lined. I shall not further discuss the difference between Ein¬ 
stein’s principles and those which I have tried to extract from 
the history of physics up to the present day. But I wish to 
collect some of the fundamental assumptions which cannot be 
further reduced but have to be accepted by an act of faith. 
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Causality is such a principle, if it is defined as the belief in the 
existence of mutual physical dependence of observable situa¬ 
tions. However, all specifications of this dependence in regard 
to space and time (contiguity, antecedence) and to the infinite 
sharpness of observation (determinism) seem to me not funda¬ 
mental, but consequences of the actual empirical laws. 

Another metaphysical principle is incorporated in the notion 
of probability. It is the belief that the predictions of statistical 
calculations are more than an exercise of the brain, that they can 
be trusted in the real world. This holds just as well for ordinary 
probability as for the more refined mixture of probability and 
mechanics formulated by quantum theory. 

The two metaphysical conceptions of causality and probability 
have been our main theme. Others, concerning logic, arithmetic, 
space, and time, are quite beyond the frame of these lectures. 
But let me add a few more which have occasionally occurred, 
though I am sure that my list will be quite incomplete. One is 
the belief in harmony in nature, which is something distinct from 
causality, as it can be circumscribed by words like beauty, 
elegance, simplicity applied to certain formulations of natural 
laws. This behef has played a considerable part in the develop¬ 
ment of theoretical physics—^remember Maxwell’s equations of 
the electromagnetic field, or Einstein’s relativity—but how far 
it is a real guide in the search of the unknown or just the expres¬ 
sion of our satisfaction to have discovered a significant relation, 
I do not venture to say. Eor I have on occasion made the sad 
discovery that a theory which seemed to me very lovely neverthe¬ 
less did not work. And in regard to simplicity, opinions will 
differ in many cases. Is Einstein’s law of gravitation simpler 
than Newton’s? Trained mathematicians wiU answer Yes, 
meaning the logical simplicity of the foundations, while others 
win say emphatically No, because of the horrible complication 
of the formalism. However this may be, this kind of belief may 
help some specially gifted men in their research; for the validity 
of the result it has little importance (see Appendix, 35). 

The last belief I wish to discuss may be called the principle of 
objectivity. It provides a criterion to distinguish subjective 
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impressions from objective facts, namely by substituting for 
given sense-data others which can be checked by other indivi¬ 
duals. I have spoken about this method when I had to define 
temperature: the subjective feeling of hot and cold is replaced 
by the reading of a thermometer, which can be done by any 
person without a sensation of hot or cold. It is perhaps the most 
important rule of the code of natural science of which innumer¬ 
able examples can be given, and it is obviously closely related 
to the conception of scientific reality. For if reality is understood 
to mean the sum of observational invariants—and I cannot see 
any other reasonable interpretation of this word in physics—the 
elimination of sense qualities is a necessary step to discover 
them. 

Here I must refer to the previous Waynflete Lectures given by 
Professor E. D. Adrian, on The Physical Background of Percep¬ 
tion, because the results of physiological investigations seem to 
me in perfect agreement with my suggestion about the meaning 
of reality in physics. The messages which the brain receives have 
not the least similarity with the stimuli. They consist in pulses 
of given intensities and frequencies, characteristic for the trans¬ 
mitting nerve-fibre, which ends at a definite place of the cortex. 
All the brain ‘learns’ (I use here the objectionable language of 
the ‘disquieting figiire of a httle hobgobfin sitting up aloft in the 
cerebral hemisphere’) is a distribution or ‘map’ of pulses. From 
this information it produces the image of the world by a process 
which can metaphorically be called a consummate piece of com¬ 
binatorial mathematics: it sorts out of the maze of indifferent 
and varying signals invariant shapes and relations which form 
the world of ordinary experience. 

This unconscious process breaks down for scientific ultra¬ 
experience, obtained by magnifying instruments. But then it is 
continued in the full light of consciousness, by mathematical 
reasoning. The result is the reality offered by theoretical physics. 

The principle of objectivity can, I think, be api^lied to every 
human experience, but is often quite out of place. For instance: 
what is a fugue by Bach ? Is it the invariant cross-section, or the 
common content of all printed or written copies, gramophone 
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records, soundwaves at performances, etc., of this piece of music ? 
As a lover of music I say No! that is not what I mean by a fugue. 

It is something of another sphere where other notions U'Pplyj 
and the essence of it is not "notions’ at all, but the immediate 
impact on my soul of its beauty and greatness. 

In oases like this, the idea of scientific objective reality is 

obviously inadequate, almost absurd. 

This is trivial, but I have to refer to it if I have to make good 
my promise to discuss the bearing of modern physical thought 
on philosophical problems, in particular on the problem of free 
will. Since ancient times philosophers have been worried how 
free will can be reconciled with causality, and after the tremen¬ 
dous success of Newton’s deterministic theory of nature, this 
problem seemed to be stiU more acute. Therefore, the advent of 
indeterroinistic quantum theory was welcomed as opening a 
possibility for the autonomy of the mind without a clash with 
the laws of nature. Free 'will is primarily a subjective pheno¬ 
menon, the interpretation of a sensation which we experience, 
similar to a sense impression. We can and do, of course, project 
it into the mind a of om* fellow beings just as we do in the case 
of music. We can also correlate it with other phenomena in order 
to transform it into an objective relation, as the moralists, 
sociologists, lawyers do—^but then it resembles the original 
sensation no more than an intensity curve in a spectral diagram 
resembles a colour which I see. After this transformation into 
a sociological concept, free will is a symbolic expression to 
describe the fact that the actions and reactions of human beings 
are conditioned by their internal mental structure and depend on 
their whole and unaccountable history. Whether we beheve 
theoretically in strict determinism or not, we can make no use 
of this theory since a human being is too complicated, and we 
have to be content with a working hypothesis like that of spon¬ 
taneity of decision and responsibility of action. If you feel that 
this clashes •with determinism, you have now at your disposal 
the modem indeterroinistic philosophy of nature, you can assume 
a certaiu ‘freedom’, i.e. de-viation from the deterministic laws, 
because these are only apparent and refer to averages. Yet if 
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you believe m perfect freedom you will get into difficulties again, 
because you cannot neglect tbe laws of statistics which are laws 
of nature. 

I think that the philosophical treatment of the problem of free 
will suffers often (see Appendix, 36) from an insufficient dis¬ 
tinction between the subjective and objective aspect. It is 
doubtless more difficult to keep these apart in the case of such 
sensations as free will, than in the case of colours, sounds, or 
temperatures. But the application of scientific conceptions to a 
subjective experience is an inadequate procedure in all such 
cases. 

You may call this an evasion of the problem, by means of 
dividing all experience into two categories, instead of trying to 
form one all-embracing picture of the world. This division is 
indeed what I suggest and consider to be unavoidable. If quan¬ 
tum theory has any philosophical importance at all, it lies in the 
fact that it demonstrates for a single, sharply defined science the 
necessity of dual aspects and complementary considerations. Niels 
Bohr has discussed this question with respect to many applica¬ 
tions in physiology, psychology, and philosophy in general. 
According to the rule of indeterminacy, you cannot measure 
simultaneously position and velocity of particles, but you have 
to make your choice. The situation is similar if you wish, for 
instance, to determine the physico-chemical processes in the 
brain connected with a mental process: it cannot be done because 
the latter would be decidedly disturbed by the physical investiga¬ 
tion. Complete knowledge of the physical situation is only 
obtainable by a dissection which would mean the death of the 
living organ or the whole creature, the destruction of the mental 
situation. This example may suffice; you can find more and 
subtler ones in Bohr’s writings. They illustrate the limits of 
human understanding and direct the attention to the question 
of fixing the boundary line, as physics has done in a narrow 
field by discovering the quantum constant Ti. Much futile 
controversy could be avoided in this way. To show this by a 
final example, I wish to refer to these lectures themselves which 
deal only with one aspect of science, the theoretical one. There 
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is a powerful school of eminent scientists who consider such 
things to be a futile and snobbish sport, and the people who 
spend their time on it drones. Science has undoubtedly two 
aspects: it can be regarded from the social standpoint as a prac¬ 
tical collective endeavour for the improvement of human 
conditions, but it can also be regarded from the individualistic 
standpoint, as a pursuit of mental desires, the hunger for know¬ 
ledge and xmderstanding, a sister of art, philosophy, and religion. 
Both aspects are justified, necessary, and complementary- The 
collective enterprise of practical science consists in the end of 
individuals and cannot thrive without their devotion. But 
devotion does not suffice; nothing great can be achieved without 
the elementary curiosity of the philosopher. A proper balance is 
needed. I have chosen the way which seemed to me to harmonize 
best with the spirit of this ancient place of learning. 
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1 . (II. p. 8.) Multiple causes 

Any event may have several causes. This possibility is not 
excluded by my definition (given explicitly on p. 9), though I 
speak there of A being ‘the’ cause of the effect B. Actually the 
‘number’ of causes, i.e. of conditions on which an effect B 
depends, seems to me a rather meaningless notion. One often 
finds the idea of a ‘causal chain’ where B depends 

directly on A^,A^ on A 2 , etc., so that B depends indirectly on 
any of the A^. As the series may never end—where is a ‘first 
cause’ to be found ?—the number of causes may be, and will be 
in general, infinite. But there seems to be not the slightest 
reason to assume only one such chain, or even a number of 
chains; for the causes may be interlocked in a comphcated way, 
and a ‘network’ of causes (even in a multi-dimensional space) 
seems to be a more appropriate picture. Yet why should it be 
enumerable at all? The ‘set of all causes’ of an event seems to 
me a notion just as dangerous as the notions which lead to logical 
paradoxes of the type discovered by Bussell. It is a metaphysical 
idea which has produced much futile controversy. Therefore I 
have tried to formulate my definition in such a way that this 
question can be completely avoided. 

2. (III. p. 13.) Derivation of Newton’s law from Kepler’s 

laws 

The fact that Newton’s law is a logical consequence of Kepler’s 
laws is the basis on which my whole conception of causality in 
physics rests. For it is, apart from Galileo’s simple demonstra¬ 
tion the first and foremost example of a timeless cause-effect 
relation derived from observations. In most text-books of 
mechanics the opposite way (deduction of Kepler’s laws from 
Newton’s) is followed. Therefore it may be useful to give the 
full proof in modern terms. 

We begin with formulating Kepler’s laws, splitting the first 
one in two parts: 

la. The orbit of a planet is a plane curve. 

16. It has the shape of an ellipse, one focus of which is the sun. 
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II. The area A swept by the radius vector increases propor¬ 
tionally to time. 

III. The ratio of the cube of the semi-axis a of the elhpse to 
the square of the period T is the same for all planets. 

From la it follows that it suffices to consider a plane, intro¬ 
ducing rectangular coordinates x,y, polar coordinates so 

X — r Gos tj>, y — rsin^. 

Indicating differentiation with respect to time by a dot, one 
obtains for the velocity 

X == rcos< 5 &~r<;&sin^, y = r sin^-f cos<^, 


and for the acceleration 

X = UyCos^—a^sin</>, y == a^Bm<f>-{-a^GOB4^, 

where (1) 

(^) 

are the radial and tangential components of the acceleration. 

Next we use II. The element of the area in polar coordinates 
is obviously 



If the origin is taken at the centre of the sun, the rate of in¬ 
crease of A is constant, say dA = \hdt, or 

2A = r'^<j> = h. (3) 

Now it is convenient to use the variable 


u = 


1 


r 
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instead of r and to describe the orbit by expressing as a function 
of (ft, u{<f>). Then ^ 

^ (4) 


4 = ^inu^).4 


2h^u^ 


Further, 


dr • 


1 du ' 


du 
-L du 

~h 


( 5 ) 

(6) 


u^d<f>^ " d<j>‘ 

Substituting (4), (5), (6) into (2), one finds 

== 0 . 

Hence the acceleration has only a radial component with 
respect to the sun. To obtain the value of we calculate, with 
the help of (4), 


dr - 

d4^ 


Ji dr _ 


r^ d(j> 


d 

d<l> 


( 


’‘mY 




d^u 


dcl>^' 


and substitute this in (1): 




■h^u^ 


Id^u , \ 


( 7 ) 


( 8 ) 


Now we use lb. The polar equation of an ellipse is 


r 


<1 


( 9 ) 


l-f-ecos^^’ 

where q is the semi-latus rectum and e the numerical eccentricity; 

1 


or 


u 


(l-j-eCOS^S). 


From this, one obtains 
du € 


d4> 

hence from (8) 


<1 


sin</>. 


d^u 

dp 


-COS<i, 

<1 




^ 2 = 
q q 


( 10 ) 


The acceleration is directed to the sun (centripetal) and is 
inversely proportional to the square of the distance. 
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According to the third law III one can write 


_ /X 

5^2 — 


(11) 


wliere the constant (x is the same for all planets. 

Now integrating (3) for a full revolution one has 

2A = hT. (12) 

On the other hand, the area of an ellipse is given by 

A ~ 'irdby ( 13 ) 

where a and b are the major and minor semi-axes. 

Taking in (9) <(> = 0 and <l> = rr one gets the aphelion and 
perihelion distances; half of the sum of those is the semi-major 

2\l + «^l —£/ 1— 

while the semi-minor axis is given by 

6 - ../a-.-) - 

hence aq — b^. 

Substituting this in (13), one gets from (12) 

(M’-(¥)■->■= 

solving with respect to h'^jq and using (11): 

y = 477-2 fia = (14) 

Therefore the law of acceleration (10) becomes 


a. 



(15) 


where jx is the same for all planets, hence a property of the sun, 
called the gravitational mass. 

This demonstrates the statement of the text that Newton’s 
derivation of his law of force is purely deductive, based on the 
inductive work of Tycho Brahe and Kepler. The new feature due 
toNewton is the theoretical interpretation of the deduced formula 
for the acceleration, as representing the ^cause’ of the motion. 
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or the force determining the motion, which then led him to the 
fundamental idea of general gravitation (each body attracts each 
other one). In the text-books this situation is not always clear; 
this may be due to Newton’s own representation in his Principia 
where he uses only geometrical constructions in the classical 
style of the Greeks, Yet it is known that he possessed the 
methods of infinitesimal calculus (theory of fluxions) for many 
years. I do not know whether he actually discovered his results 
with the help of the calculus; it seems to me incredible that he 
should not. He was obviously keen to avoid new mathematical 
methods in order to comply with the taste of his contemporaries. 
But it is known also that he liked to conceal his real ideas by 
dressing them up. This tendency is found in Gauss and other 
great mathematicians as well and has survived to our time, 
much to the disadvantage of science. 

Newton regarded the calculation of terrestrial gravity from 
astronomical data as the crucial test of his theory, and he with¬ 
held publication for years as the available data about the radius 
of the earth were not satisfactory. The formula (3.3) of the text 
is simply obtained by regarding the earth as central body and 
the moon as ‘planet’. Then p, is the gravitational mass of the 
earth which can be obtained from (11) by inserting for a the 
mean distance R of the centre of the moon from that of the earth, 
and for T the length of the month. Substituting fx — 4:7r^R^IT^ 
into (15), where r is the radius of the earth, one obtains for the 
acceleration on the earth’s surface g ( = —a,.) the formula (3.3) 


of the text. 


9 • 


(16) 


If here the values RJr — 60, R= 3*84x10^® cm., and 
T — 27^ 7^ 43^ 11*5® = 2*361 X 10® sec. are substituted, one 
finds g — 980 *2 cm. sec.“2, while the observed value (extrapolated 
to the pole) is gr == 980*6 cm. sec.~® 

This reasoning is based on the plausible assumption that the 
acceleration produced by a material sphere at a point outside is 
independent of the radial distribution of density and the mass of 
the sphere can therefore be regarded as concentrated in the 
centre. The rigorous proof of this lemma forms an important 
part of Newton’s considerations and was presumably achieved 
with the help of his theory of jfiuxions. 
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3. (IV. p. 20.) Cauchy’s mechanics of continuous media 

The mathematical tool for handling continuous substances is 
the following theorem of Gauss (also attributed to Green). 

If a vector field A is defined inside and on the surface S of a, 
volume F, one has 

div A dV == J A.n dS, (1) 

s 

where n is the unit vector in the direction of the outer normal of 
the surface element dS and 



divA = 


dA„ 8A„ d . 


( 2 ) 


If p is the density, the total mass inside V is 

m — j p dV. (3) 

The amount of mass leaving the volume through the surface is 


J u. n dS, 

s 

where u = pv is the current, v the velocity. 


The indestructibility of mass is then expressed by 


J u. n d/S = 0. 


Substituting (3) and applying (1), one obtains a volume 

integral, which vanishes for any surface; hence its integrand 

must be zero: . , 

p-hdivu = 0. (4) 


This is the continuity equation (4.5) of the text. 

Consider now the forces acting on the volume V. INTeglecting 
those forces which act on each volume element (like Newton’s 
gravitation), we assume with Cauchy that there are surface 
forces or tensions, acting on each element dS of the surface S, 
and proportional to dS. They will also depend on the orientation 
of dS^ i.e. on the normal vector n, and can therefore be written 
dS. If n coincides with one of the three axes of coordinates 
oc,y,z, the corresponding forces per unit area may be repre¬ 
sented by the vectors Tjp, T^. Now the projections of an 
element dS on the coordinate planes are 

= n^dS, dSy = UydS, dS^ = n^dS. 
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The equilibrium of the tetrahedron with the sides dS, dS^, dSy^ dS^ 
then leads to the equation 

T^dS = T^dS^-^Tyd8y+T,dS,, 

or = T ^n^-^TyTiy-^-T gTig, (5) 

which is the formula (4.6) of the text. 


;r 

Fig. 4. 

Consider further the equilibrium of a rectangular volume 
element, and in particular its cross-section 2 — 0, with the sides 
dx, dy. The components of T^. in this plane may be denoted by 
and T^y, those of T^, by Tyy. Then the tangential com- 



Fio. 5. 



ponents on the surfaces dydz and dxdz produce a couple about the 
origin O with the moment 

{T^y dydz)dx — {Ty^ dxdz)dy. 

This must vanish in equilibrium; therefore one has 

T = T 

•*-xy 

and the corresponding equations obtained by cyclic permutation 
of the indices, (4.7) of the text. Hence the stress tensor T defined 
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(4-8) is symmetrical. One can express this, with the help of 
(5), in the form 

C^n)x — ^xx'^x~^^yx'^y~^^zx'^s ~ T^.H, 

Where is the vector (T,,, T^^). 

The ic-component of the total force F = J dS can now be 
transformed with the help of formula (1) into a volume integral 

= f = f T^.ndS = f divT^dV. 

-S' F 

Using the tensor notation of the text, (4.10), one can write this 

F=JdivTti:F. (6) 

This has to be equated to the rate of change of momentum of a 
given amount of matter, i.e. enclosed in a volume moving in 
time. One has for any function of space 

V«+A<) V(t) ^ 

j j ■ 

V(#) AV ^ 

The second mtegral is extended over the volume between two 
infimtesimally near positions of the surface, so that 

dV = n.vAtdS 

and therefore 


Hence 


J''I>dF = J<I>n.vdSAi= f div(<l>v) dV At. 

Vs 

^j0dv = j |^+(iiv(<i>v)| dr. 


(7) 


components of the momentum density 
P e obtams for the rate of change of the total momentum 

dt ^ f = f l^^§p + '^^(P'^)j dV 

^ J ['’{S+ + div{pv)j] dr. 
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Here the second integral vanishes in consequence of the continu¬ 
ity equation (4), with u = pv. In the first integral appears the 
convective derivative, defined by (4.11) of the text, 

dv Sv , / 5 \ 

3r = -^+ V- ^ V. 


Hence 


p dV, 
^ dt 


Now the equation of motion 


reduces in virtue of (6) and (8) to (4.9) of the text: 

d-v __ 

p —- = div T. 


Consider in particular an elastic fluid where 

T ~ T z= T = _ 'D T — T — T — O 

‘^xx vv ^ zz Jr'> '^yz -^zx ’^xy — 

and the pressure ^ is a function of p alone. Then the continuity 
equation and the equations of motion 

|^ + div(pv) = 0, 




v}-}- 


djp dp 
dp 0x 


are four differential equations for the four functions p, Vy, v^. 

If one wishes to determine small deviations from equilibrium, 
then v and <f> — p — p^ are small and pq constant with regard to 
space and time. Then the last two equations reduce in first 
approximation to , 

^+Podivv= 0, 

8t \dp)f,dx. 

By differentiating the first of these with respect to time and 
dv 

substituting po-—from the second, one finds 

ot 


U/>/o W) 


= 0 , 
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or with 

diw J?_A 

5x * ax ^ 


and 

(1).='■ 

(10) 


1 8^ 
c2 et^ ~~ 

(11) 


f ^“riation of 

equation wWh^ 'P^Iooity components satisfies the same 

q ion, which IS the prototype of aU laws of wave propagation. 

^ ^ Maxwell s equations of the electromagnetic 

deS^B“th^!xn®f"“^^ part Of MaxweU’s work consisted in con- 

oSfreSreo^^- in the text, in a set 

ot diiierential equations which, with the usual notation, are 

divD = 47rp, curlH =^u 


divB = 0, 


curlH 

c 

curlE + is = 0, 


( 1 ) 


D = eE, B = 

field example. Coulomb’s law for the electrostatic 

remaSs ^ ^ B == 0, H == o, u = 0; then there 

divD = 4:Trp^ curlE = 0. 

The second equation implies that there is a potential 4 ,, such that 

E = 

ax' 

equatio^'*'’’ D = E, one obtains therefore Poisson’s 

—divE = A4= —irrp. 


The solution is 


<l> 




( 2 ) 


CouTottsti^'fcrt' -P—« 

B.O. s„« a.. «s "ST^z " f 
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Maxwell’s physical idea consisted in. discovering the asym¬ 
metry in the equations (1) which, in our style of writing, is 

obvious even to the untrained eye: the missing term in the 

c 

second equation. The logical necessity of this term follows from 
the fact of the existence of open currents, e.g. discharges of con¬ 
densers through wires. In this case the charge on the condenser 
changes in time, hence p ^ 0; on the other hand, the equations 

(1) imply divu = — divcurlH = 0. Therefore the continuity 

4:77 

equation p+<iivu = 0 is violated. 

To amend this Maxwell postulated a new type of current 
bridging the gap between the conductors in the condenser, with 
a certain density w, so that 

4:77" 

curlH == —(u-f-w). (3) 

c 

Then taking the div operation one has 

div w = —div u == p = — divD. 

Itt 

The simplest way of satisfying this equation is putting 

w = Ad, (4) 

SO that the corresponding equation in Maxwell’s set becomes 

curlH —il> = —u (5) 

c c ^ ' 

and complete symmetry between electric and magnetic quanti¬ 
ties is obtained (apart from the fact that the latter have no true 
charge and current). 

The modified system of field equations permits the prediction 
of waves with finite velocity. In an isotropic substance free of 
charges and currents (p = 0, u = 0, D = cE, B = p,H) one has 

curlH —iE = 0, curlE + ^i^H = 0; 
c c 

taking the curl of one of them, and using the formula that for a 
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vector with vanishing div one has curl curl = —A, one obtains 
for each component of E and H the wave equation 



For vacuum (e = /x = 1) the velocity of propagation should 
therefore be equal to the electromagnetic constant c. As stated 
in the text, this constant has the dimensions of a velocity and can 
be measured by determining the magnetic field of a current 
produced by a condenser discharge (measured therefore electro¬ 
statically). Such experiments had been performed by Kohl- 
rausch and Weber, and their result for c agreed with the velocity 
of light in vacuo. This evidence for the electromagnetic theory of 
light was strongly enhanced by experiments carried out by 
Boltzmann, which showed that the velocity of light in simple 
substances (rare gases, which are monatomic) can be calculated 
from their dielectric constant e being practically = 1) with the 
help of Maxwell’s formula = c/Ve. 

Maxwell’s formulation satisfies contiguity, but its relation to 
Cauchy’s form of the dynamical laws has still to be established. 
The electric and magnetic field vectors, though originally defined 
by the forces on point charges and magnetic poles (which act ually 
do not exist), are defined by the equations also in places where 
neither charges nor currents exist. Yet they are not stresses 
themselves; they are analogous to strains, on which the stresses 
depend. The law of this connexion has also been found by Max¬ 
well; it is a mathematical formulation of Faraday’s intuitive 
interpretation of the mechanical reactions between electrified 
and magnetized bodies. A short indication must here suffice. 

Apart &om the electric force on a point charge e, F = cE, 
there exists a mechanical force on the element of a linear current 
u, produced by a magnetic field H; this force is perpendicular 
to H and to the current u and therefore does no work. It is not 
quite umquely determined, as one can obviously add any force 

whose line integral over a closed circuit vanishes. The simplest 
expression is: 

F = iuAB, 
c 


T considering the change of magnetic energy 

^ produced by a virtual displacement of an element 
oi the current. 
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To illustrate Maxwell’s procedure it suffices to consider charge 
distributions in vacuo with density p and current u = pv. 

\ 

Combining the two forces F = eE and F = ~ u A B into one 

c 

expression, one has for the density of force 


f = p|E + i(vAH)|, 


the so-called Lorentz force. 

Substituting here for p and pv the expressions from Maxwell’s 
equations one can, by elementary transformations, bring f into 
the form of Cauchy, , -r 


where 


divT, 




(E^E,+H„H,), 


These are the celebrated formulae of Maxwell’s tensions. They 
can be easily generalized for material bodies with dielectric 
constant and permeability, and they have become the prototype 
for similar expressions in other field theories, e.g. gravitation 
(Einstein), electronic field (Dirac), meson field (Yukawa). 


5. (IV. p. 27.) Relativity 

It is impossible to give a short sketch of the theory of relativity, 
and the reader is referred to the text-books. The best representa¬ 
tion seems to me still the article in vol. v of the Mathematical 
Encyclopaedia written by W. Pauli when he was a student, 
about twenty years of age. There one finds a clear statement of 
the experimental facts which led to the mathematical theory 
almost unambiguously. Eddington’s treatment gives the im- 
pression that the results could have been obtained—or even 
have been obtained—by pure reason, using epistemological 
principles. I need not say that this is wrong and misleading. 
There was, of course, a philosophical urge behind Einstein’s 
relentless effort; in particular the violation of contiguity in 
Newton’s theory seemed to him unacceptable. Yet the greatness 
of his achievement was just that he based his own theory not on 
preconceived notions but on hard facts, facts which were obvious 
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to everybody, but noticed by nobody. The main fact was the 
identity of inertial and gravitational mass, which he expresse 
as tlie principle of equivalence between acceleration and gravi a 
tion. An observer in a closed box cannot decide by any experi¬ 
ment whether an observed acceleration of a body in the bc^ is 
due to gravity produced by external bodies or to an acceleration 
of the box in the opposite direction. This principle means tha 
arbitrary, non-linear transformations of time must be admitted. 
But the formal symmetry between space-coordinates and time 
discovered by Minkowski made it very improbable that the 
transformations of space should be linear, and this was corin 
borated by considering rotating bodies: a volume element on the 
periphery should undergo a peripheral contraction according 
to the results of special relativity, but remain unchanged in the 
radial direction. Hence acceleration was necessarily connected 
with deformation. This led to the postulate that all laws of 
nature ought to be unchanged (covariant) with respect to 
arbitrary space-time transformations. But as special relativity 
must be preserved in small domains, the postulate of invariance 
of the line element had to be made. 

The long struggle of Einstein to find the general co variant field 
equations was due to the difficulty for a physicist to assimilate 
the mathematical ideas necessary, ideas which were in fact 
completely worked out by Riemann and his successors, Levi— 
Civita, Ricci, and others. 

I wish to add here only one remark. The physical significance 
of the line element seems to me rather mystical in a genuinely 
continuous space-time. If it is replaced by the assumption of 
parallel displacement (affine connexion), this impression of 
mystery is still further enhanced. On the other hand, the 
appearance of a> finite length in the ultimate equations of physics 
can be expected. Quantum theory is the first step in this direc¬ 
tion; it introduces not a universal length but a constant, Planck’s 
of the dimension length times momentum into the laws of 
physics. There are numerous indications that the further 
development of physics will lead to a separate appearance of 
these two factors, ^ in the ultimate laws. The difficulties 

of present-day physics are centred about the problem of intro¬ 
ducing this length g in a way which satisfies the principle of 
relativity. This fact seems to indicate that relativity itself 
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needs a generalization where the infinitesimal element ds is 
replaced by a finite length. 

The papers quoted in the text are: A. Einstein, L. Infeld, and 
B. Hoffmann, Ann. of Math., 39, no. 1, p. 65 (Princeton, 
1938); V. A.Eock, Jowrn. of Phys. U.S.S.E. l,no. 2,p. 81 (1939). 

6. (V. p. 38.) On classical and modern thermodynamics 

It is often said that the classical derivation of the second law 
of thermodynamics is much simpler than Caratheodory’s as it 
needs less abstract conceptions than PfafiSan equations. But 
this objection is quite wrong. For what one has to show is the 
existence of an integrating denominator of dQ. This is trivial 
for a Pfaffian of two variables (representing, for example, a single 
fluid with ¥,■&); it must be shown not to be trivial and even, in 
general, wrong for Pfaffians with more than two variables (e.g. two 
fluids in thermal contact with V^,d'). Otherwise, the student 
cannot possibly understand what the fuss is all about. But that 
means explaining to him the difference between the two classes 
of Pfaffians of three variables, the integrable ones and the non- 
integrable ones. Without that all talk about Carnot cycles is just 
empty verbiage. But as soon as one has this difference, why not 
then use the simple criterion of accessibility from neighbouring 
points, instead of invoking quite new ideas borrowed from 
engineering ? I think a satisfactory lecture or text-book should 
bring this classical reasoning as a corollary of historical interest, 
as I have suggested long ago in a series of papers {Phys. Zeitschr. 
22, pp. 218, 249, 282 (1921)). 

Since writing the text I have come across one book which 
gives a short account of Carath6odory’s theory, H. Margenau 
and G. M. Murphy, The Mathematics of Physics and Chemistry 
(D. van Hostrand Co., New York, 1943), § 1.15, p. 26. But 
though the mathematics is correct, it does not do justice to the 
idea. For it says on p. 28: ‘This formal mathematical con¬ 
sequence of the properties of the Pfaff equation [namely the 
theorem proved in the next section of the appendix] is known as 
the principle of Caratheodory. It is exactly what we need for 
thermodynamics.’ Caratheodory’s principle is, of course, not 
that formal mathematical theorem but the induction from obser¬ 
vation that there are inaccessible states in any neighbourhood, 
of a given state. 
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7. (V. p. 39.) Theorem of accessibility 

An example of a Pfaffian whicli has no integrating denomina- 
tor (by the way, the same example as described in geometrical 
terms in the text) is this: 


dQ = —y dx-\-x dy~\-h dz, 

where & is a constant. If it were possible to write dQ in the 
form Xd<j>, where A and <}> are functions of x, y, z, one would have 


H _ _y. 


dx 


hence 


dy 


dydz dz\Xj 




__ X 

“A’ 

ay 

dzdx 


dz ~ A’ 


dzxXi 


dx\X) 


dxdy 


or 


I. 

dy 


(y\ _ 

\A/ dx\x)' 


dz dy 


■y 


dz 


dx 


2A 


X - Uy — . 

dx'^dy 


• substituting dX/dx and SX/dy from the first two equations 
m the third one finds ^ 

A = 0. 

Examples hke this show clearly that the existence of an 
mtegratmg denominator is an exception. 

give the proof of the theorem of accessibility. 
Consider the solutions of the Pfaffian 


dQ = X dx-\-T dy-\~Z dz — 0, 
which lie in a given surface S, 

X = x{u, v), y == 2 : = z{u, v). 

They satisfy a Pfaffian 

dQ == TJ du~\-V dv — 0, 

U 


( 1 ) 


du dy, dv, 


F = X 


dx 


dy 


dv 


dv 


Z 


dz 

dv' 


where 


(2) 
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Hence through every point JP of S there passes one curve, because 
(2) is equivalent to the ordinary differential equation 

dv ~ U’ 


which has a one-parameter set of solutions v) = const., 
covering the surface 

Let us now suppose that,inthe neighbourhood of apointP, there 
are inaccessible points; let Q be one of these. Construct through 




Pia. 7. 


P a straight line which is not a solution of (1), and the plane tt 
through Q and In tt there is just one curve satisfying (1) and 
going through Q\ this curve will meet the line oSf at a point It, 
Then P must be inaccessible from P; for if there should exist a 
solution leading from P to P, then one could also reach Q from 
P by a continuous (though Idnked) solution curve, which 
contradicts the assumption that Q is inaccessible from P. The 
point R can be made to lie as near to P as one wishes by choosing 
Q near enough to P. 

How we move the straight line JSf parallel to itself in a cycho 
way so that it describes a closed cylinder. Then there exists on 
this cylinder a solution curve ^ which starts from P on and 
meets again at a point N. It follows that N and P must coin¬ 
cide. For otherwise one could, by deforming the cylinder, make 
N sweep along the line cSf towards P and beyond P. Hence there 
would be an interval of accessible points (like N) around P, 

5131 T 
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while it has been proved before that there are inaccessible points 
Q in any neighbourhood of P. 

As JV now coincides with P the connecting curve ^ can be 
made, by steady deformation of the cylinder, to describe a 
surface which contains obviously all solutions starting from P. 
If this surface is given by z) — 0, one has 

dQ = A 

which is the theorem to be proved. 

The function ^ and the factor A are not uniquely determined; 
if ^ is replaced by one has 

Xd<f> = Ad<E) with A = A . 

dcj) 

8. (V. p. 43.) Thermodynamics of chemical equilibria 

Caratheodory’s original publication on his foundation of 
thermodynamics {Math, Ann. 61, p. 355, 1909) is written in a 
very abstract way. He considers a type of systems which are 
called simple and defined by the property that of the parameters 
necessary to fix a state of equilibrium all except one are con¬ 
figurational variables, i.e. such that their values can be arbi¬ 
trarily prescribed (like volumes). In my own presentation of the 
theory, of 1921 (quoted in Appendix, 6), there is only a hint at the 
end (§ 9) how such variables can be introduced in more complica¬ 
ted cases, as for instance for chemical equilibria where the concen¬ 
trations of the constituents can be changed. I hoped at that time 
that this might be worked out by the chemists themselves, for it 
needs nothing more than the usual method of semi-permeable 
walls with a shght modification of the wording. As this has not 
happened, I shall give here a short indication how to do it. 

I consider first a simple fluid (without decomposition), but 
arrange it in such a way that volume F and mass M are both 
independently changeable. For this purpose one has to imagine 
a cylinder with a piston attached to the volume F, connected 
by a valve, through which substance can be pressed into the 
volume F considered. The position of the auxiliary piston 
determines uniquely the mass M contained in F; hence M can 
be regarded as a configuration variable in Carath^odory’s sense. 
If the valve is closed, F can be changed, by moving the ‘main’ 
piston, without altering M. Hence M and F are both indepen- 
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dent configuration variables, and the work done for any change 
of them must be regarded as measurable. If this work is deter¬ 
mined adiabatically one obtains the energy function, say in 
terms of F, M and the empirical temperature -d-, ?7(F, 



Fxg, 8. 


When this is known the differential of heat is defined by the 
difference _ dU+pdV-ixdM, (1) 


where p and ft are functions of the state (F, M,d') like U, which 
can be regarded as empirically known. 

Now one has in (1) a Pfaffian of three variables and can apply 
the same considerations as before which lead to the result that dQ 
is integrable and can be equated to T dS. Hence one can write 


dU = TdS-pdV-^ixdM. (2) 

But U must be a homogeneous function of the first order in the 
variables 3, F, ikf. If one introduces the specific variables 

w.-s.'fby U == Mu, S = Ms, V = Mv, (3) 

one has according to Euler’s theorem 

u = Ts —jpi;-l-ft, (4) 

du du 


where 


T = 


V 


(5) 


ds dv 

and then, from (2), du = Tds — pdv. (6) 

If the substance inside F is a chemical compound and one wishes 
to investigate its decomposition into n components, one assumes 
n cylinders with pistons attached to F, separated from F by 
semi-permeable walls, each of which allows the passage of only 
one of the components. Then one has in the same way 


dV 


n 


TdS—pdV+ y (jLidM^ 

i—1 




(7) 
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t>e regarded as configuration variables 
and ?7, p, ^ 2 ,..., /x^ as known functions of these. Now, as above, 
the specific energy, entropy, and volume are introduced and 
further the concentrations by 

Mi = CiM, ( 8 ) 

where M is the total mass: JIT = J hence 

% 

n 

.2ci=l- (9) 

One obtains from Euler’s theorem 


U = Ts—pv+ 

i 


( 10 ) 


with 





( 11 ) 


where the differentiation with respect to the is performed as 
if they were independent; and 


- 7^ 

du = Tds—pdv-i- '^PidCi. ( 12 ) 

i=l 

^ The formalism of thermodynamics consists in deriving rela¬ 
tions between the variables by differentiating the equations 
(11), e.g. P^rp ^ ^ 

— = 
dv 


ds 


dv 


8c/ 


etc. 


As experiments are often performed at constant pressure or 
temperature or both, one uses instead of the 

functions free energy u-Ts =/, enthalpy u+pv = w, or free 

enthalpy — u-\-pv—Ts (defined by (4)). For the latter, for 
example, it follows from (12) that 

dfx = sdT-{-vdp-\- '^{XidCi, 


or 


dT’ 


V = 


dfjL 


Pi, = 


_ S/x 


Sc, 

so that for T = const., p = const., one has simply 

2 p, dc,. 

% 

The most important theorem, which follows from these general 


(13) 

(14) 

(16) 
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equations, is Gibbs’s phase rule. The system may exist in differ¬ 
ent phases if the n equations 

= Ci (i = 1 ,..., n) (16) 

have several solutions. Let m be the number of independent 
solutions; then there are m phases which can be ordered in such a 
way that each has contact with two others only. Hence there 
are m—l interfaces and n equations of the form (16) for each, 
altogether n(in —1) independent equations. On the other hand, 
there are (n —1) independent concentrations for each 

phase, i.e. m{n — 1) for the whole system, to which the two vari¬ 
ables p, T have to be added; hence m{n —l)-f-2 independent 
variables. The number of arbitrary parameters, or the number of 
degrees of freedom of the system, is therefore 

ni{n — 1)-|-2 — n{m —1) == n — m -|- 2 . 

So for a single pure substance, n = 1, the number of degrees of 
freedom is 3—m; hence there are three cases m — 1, 2, 3 corre¬ 
sponding to one phase, two or three coexisting phases; more than 
three phases cannot be in equilibrium. All further progress in 
thermodynamics is based on special assumptions about the func¬ 
tions involved, either prompted by experiment, or chosen by an 
argument of simplicity, or—and this is the most important 
step—derived from statistical considerations. 

9. (V. p. 44.) Velocity of sound in gases 

The simple problem of calculating the adiabatic law for an 
ideal gas gives me the opportunity to show how the theory of 
Carath6odory determines uniquely the absolute temperature 
and entropy. 

The ideal gas is defined by two properties: (1) Boyle’s law, 
the isotherms are given by pV — const.; (2) the same quantity 
pV remains constant if the gas expands without doing work. In 
mathematical symbols. 


pV 


u = C7(^). 


Hence dQ = 

dU-\-p dV 

dV 

= U'd^^F^^ . 

(1) 

If 6 is defined by 

log 0 = r 

U'{-&)dd^ 

(2) 


^ j 

FiJF} ’ 
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(1) can be written 

dQ = F{d')dlog{dV)i 

hence one can pnt A = F{&)y <j> = log(^F^) and obtain from the 
equation (5.25) of the text 


= 


Slog-F(T9’) 

__ 


Then (5.27) gives, writing C = 1/-R, the usixal form of the 
equation of state ( 3 , 


and S = S^+R log(/9F). 

If the special assumption is made, that U depends linearly on 
pV (which holds for dilute gases with the same approximation 
as Boyle’s law), one has U — c^T and, from (2), 


ioge = /“^ = |iogr. 

The entropy becomes therefore 

>8^ = ^o+log(T"*F^), (4) 

or, substituting p for T from (3), 

S = 8^-\-\og{p^vV^p), (5) 

w^ere = c^+R (6) 

is the specific heat for constant pressure. Hence the adiabatic 
law /S' = const, is equivalent to 


pF^ = const.. 


'V 


'V 


(7) 


which is identical with the equation p — apV in "the text, as the 
density p is reciprocal to the volume F. 

The velocity of sound was calculated in Appendix, 3; according 
to (3.10) it is 

c= /^. 
aJ dp 

F r the isothermal law, p = ap, this means 
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while, for the adiabatic law, jp = apy, one finds 



which is considerably larger; e.g. for diatomic molecules (air) 
experiment, and kinetic theory as well, give y = 2 — i . 4 ^ 

10. (V. p. 45.) Thermodynamics of irreversible processes 

Since I wrote this section of the text a new development of 
the descriptive or phenomenological theory has come to my 
knowledge which is remarkable enough to be mentioned. 

It started in 1931 with a paper by Onsager in which the 
attempt was made to build up a thermodynamics of irreversible 
processes by taking from the kinetic theory one single result, 
called the theorem of microscopic reversibihty, and to show that 
this suffices to obtain some important properties of the flow of 
heat, matter, and electricity. The starting-point is Einstein’s 
theory of fluctuations (see Appendix, 20), where the relation 
S — /clogP between probability P and entropy 8 is reversed, 
using the known dependence of 8 on observable quantities to 
determine the probability P of small deviations from equili¬ 
brium. Then it is assumed that the law for the decay of an 
accidental accumulation of some quantity (mass, energy, tem¬ 
perature, etc.) is the same as that for the flow of the same 
quantity under artificially produced macroscopic conditions. 
This, together with the reversibility theorem mentioned, 
determines the main features of the flow. The theory has been 
essentially improved by Casimir and others, amongst whom 
the book of Prigogine, from de Donders’s school of thermo¬ 
dynamics in Brussels, must be mentioned. Here is a list of the 
hterature: 

L. Onsager, Phys. Rev. 37, p. 405 (1931); 38, p. 2265 (1931). 

H. B. G. Casimir, Philips Research Reports, 1, 185—96 (April 1946); 

Rev. Mod. Physics, 17, p. 343 (1945). 

C. Eckhart, Phys. Rev. 58, pp. 267, 269, 919, 924 (1940). 

J. Meixner, Ann. d. Phys. (v), 39, p. 333 (1941); 41, p. 409 (1943); 

43, p. 244 (1943); Z. phys. Chem. B, 53, p. 235 (1943). 

S. R. de Groot, L'‘Effet Soret, thesis, Amsterdam (1945); Journal 

de Physique, no. 6, p. 191. 

I. Prigogine, jStude thermodynamique des phinomhnes irriversihles 

(Paris, Dunod; Li^ge, Hesoer, 1947). 
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11. (VT. p. 47.) Elementary kinetic theory of gases 

To derive equation (6.1) of the text, consider the molecules 
of a gas to be elastic balls which at impact on the wall of the 

vessel recoil without loss of" energy and 
momentum. If the yz-'p\ajxe coincides 
with the wall the a;-component of the 
momentum of a molecule is changed 
into —hence the momentum 2m^ is 
transferred to the wall. Let 
number of molecules per unit of volume 
having the velocity vector v. If one 
constructs a cylinder upon a piece of the 
wall of area unity and side v dt, all mole¬ 
cules in it will strike the part of the wall 
within the cylinder in the time-element dt; the volume of the 
cylinder is ^dt, hence the number of collisions per unit surface 
and unit time, and the total momentum transferred 

This has first to be summed over all angles of incidence (i.e. 
over a hemisphere); the result is obviously the same as one-half 
of the smn over the total sphere, namely 

where is the number of molecules per unit of volume, with a 
velocity of magmtude v (but any direction). JNTow the ‘principle 
uiolecular chaos’ is used according to which 

^ ^ = iVK 

Hence the last expression is equal to—n^v^, and the pressure is 

O 

finally obtained by summmg over all velocities 

^ ^ n m ~-s 

p =-^^v^n^=^~nv^, ( 1 ) 

The total (kinetic) energy in the volume V is 



U = —nv^V; 

hence one obtains the equation (6.1) of the text, 

Vp = %rj. 


S^i^eZ *l^®<='«^iderations of Appendix, 9, using the 
xp mnental fact expressed by Boyle’s law (that all states of a 
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gas at a fixed empirical temperature d' satisfy pV — const.). 
Then one obtains 

pV = JRT, U = IjKT, (2) 

as stated in ( 6 . 2 ) of the text. 


12. (VI. p. 50.) Statistical equilibrium 

If H depends only on p, not on x, the equation [IT, /] 

. 


reduces to 


P. 


0 X 


0, 


0 


( 1 ) 


( 2 ) 


and is equivalent to the set of ordinary differential equations 

dx dp dz 

'Px 'Py P& 

By integrating these (p is constant) one obtains the general 
solution of ( 1 ) as an arbitrary function of the integrals of ( 2 ), 
namely ^ ^ 3 ^ 

where m = XAp. (4) 

ISTow if the gas is isotropic, / can depend only on p^ and 
and if it is to be homogeneous (i.e. all properties are independent 
of x), cannot appear; hence 

/ = 0(p2) = 0,(i^), (5) 

as stated in the text. 


13. (VI. p. 51.) Maxwell’s functional equation 

To solve the equation ( 6 . 10 ) it suffices to take ^ 3 = 0 ; putting 
= X, = y, one has 


f{x-{-y) = <f>(x)</>{y). 

Differentiating partially with respect to x^ 

f'(x-i-y) = 4>'{xmy), 

and dividing by the original equation 

r{x+y) ^ <l>'{x) 
f{x-^y) <j>ix) ' 


( 1 ) 

( 2 ) 

( 3 ) 
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JSTow, the right-hand side is independent of y, and the left-hand 
side cannot therefore depend on y\ hence, putting a; = 0, 

^ = -/3. 

S(y) 

where /S is a constant. By integration, 

— ae~^y — (^) 

which is the formula (6.11) of the text. 


14. (VI. p. 52.) The method of the most probable distribu¬ 
tion 

We have to determine the probability of a distribution of 
equal particles over N cells, where n-^ of them are in the first cell, 
of them in the second, etc. (%-f-%2-|-...-j-9^Jv = n). To do this 
we first take the particles in a :^ed order; then the probability of 
distribution (%, is, according to the multiplication law, 


». .OJ1 CO2 ...CiOg. 


.ct>jyr 






5 




where co^, a> 2 ,.-., cojy^ are the relative volumes of the cells, nor¬ 
malized so that o)^ == 1. To obtain the probability 

asked for, we have to destroy the fixed order of the particles. If 
one performs first all n 1 permutations, one gets too many cases, 
as all those distributions, which differ only by permuting the 
particles in each cell, count only once. Therefore one has to 
divide n\ by the number of all these permutations inside a cell, 
that is by ni\n^\.,.n 2 ^\ The total result is the expression (6.15) 

n\ 




'fh-^ 1 ^^2 • • • * 




CO-{^CO 




( 1 ) 


which is nothing but the general term in the polynomial expan¬ 
sion 

n\ 


2! ^23'"J^Ar) 


2 


1 • • ^ 




We now deal with the approximation of n\ by Stirling’s 
formula. The simplest way to obtain it is this: write 

log(n!) = log(1.2.3...9^) = log 1-j-log 2-{-log 3-f-log w 




APPENDIX 


165 


n 

and replace tlie sum 2 ^ integral 

*=i 

7h 

f logic dx — n(\.o^n —1). 

*» 

0 

A more satisfactory derivation is the following: One can represent 
nl by an integral and evaluate it with the help of the so-called 
method of steepest descent, which plays a great part in the 
modem treatment of statistical mechanics due to Darwin and 
Fowler (see p. 54). The approximate evaluation of n I may serve 
as a simple example of this method. 

If the identity 

dx 

is integrated from 0 to cx3 and the abbreviation (F-function) 

OO 

r(n+l) = J dx (n > —1), (2) 

0 

used, one obtains r(7i-l-l) = nT(n). (3) 

As r(l) = 1, one has 

r( 2 ) = 1 . r(i) = 1, r(3) = 2 r( 2 ) = 1 . 2 , 

r(4) = 3r(3) =: 1.2.3, 

and in general ^(?^-f■l) = (^) 

The integral (2) can be written 

CX) 

T{n-\-l) — J cZcc, f{n,x) = —x-{-nlogx. (6) 
0 

The function f{n, x) (hence also the integrand) has a maximum 
where 

f'{x) — ~ 


i.e. Sbt X — n, and 

f(n) = — n-\-nlogny 
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^ neighboTzrhood of the maodmum 


fi^) 


-n+nlogn~±(x-n)^+.... 


and one has 

r(M+l) = e-»+"i°®» J 

^ VC!!-.) 

for large n. Hence 

n. r(72.-f-l) = ^/( 27 rn)e~‘^^n'^-}-.., /g) 

and log.I = 

logP ^ ^^s(^^s)+const., (8^ 

where j. \ 

/ox , '^s(logco^~log /g\ 

the text! "’®’ ®q«i-^alent to formula (6.17) of 

ditions (6."lS (6^in?tS te^SmTly ^ 

2 -^ JV 

- », 2 = Cl. 

Without using the special form of one obtains 

= ( 11 ) 

(9) Lagrangian factors. For the special function 

Sn.,~^oga.,-log»,_l, (12) 

and if this is substituted in ( 11 ) with A +1 = 

logm^ = logw^+a-^e^, 

Tig. = ct)„ e“-iS^4 - 

that is, for equal cu’ci +>ici -p i , * (13) 

qnai ct> s, the formnla (6.18) of the text. 
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If one has two sets of systems A and i?, as discussed in the text, 
there are three conditions 


iV(^) 

r=l r==l 


iVU) 


n. 


(A) 




4^>+ 


2 


n. 




U, 


and therefore instead of two multipliers three, p ; and 

one obtains, with 1 = —1 = —the formulae 
(6.19) of the text, which show that p is the equilibrium para¬ 
meter, a function of the (empirical) temperature ^ alone. 

In order to see that p is reciprocal to the absolute tempera¬ 
ture one must apply the second theorem of thermodynamics, 
which refers to quasi-static processes involving external work 
(for instance by changing the volume). 

By an infinitely slow change of external parameters 
the energies of the cells will be altered and at the same time 
the occupation numbers n,.; the total energy will be changed by 


dU — 2 2 2 ^rdn^. 


while the total number of particles is unchanged, 

dn — dn^ — 0. 

T 

The first term in (14) represents the total work done 

dW ^ -n-^f.da,. 


where 



i V TO 
nZ^ ’^aa„ 

r ^ 


(14) 

(15) 

(16) 
(17) 


is the average force resisting a ehange of a„. Then the second 
term in (14) dQ = J,.^dr., (18) 


must represent the heat produced by the rearrangements of the 
systems over the cells. 

The corresponding change of logP is obtained from (8) 
and (11), 

dlog P = 2 = 2 (A+jSe,)dTO,, 

r ^ 

which in virtue of (15) and (18) reduces to 


<ZlogP = p dQ. 


( 19 ) 
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This shows that pdQ is a, total differential of a function depend¬ 
ing on a^, and that is the integrating factor. 

Hence the second law of thermodynamics is automatically 
satisfied by the statistical assembly, and one has, with the 
notations of section V, 

dQ — with A = ^ ^ = logP; 

p 

then (5.25) and (5.26) give, with G = l/h, logO =0, ^ = 1, 
and (5.27) ^ 

= H = ^ log P. (20) 

h is called Boltzmann’s constant. 

How the change of energy (14) becomes 

dU = -n^f^da^-J^TdJS. (21) 

If one has a fluid with the only parameter the corre¬ 

sponding force is the pressure 




2 d€ 


( 22 ) 


and one obtains the usual equation 

dU = —pdV-^TdS, (23) 

one general expression (21) one sees easily that 

function Tor quantities in terms of the so-called partition 

junction (or sum-over-states’) ^ 

-Z = X (24) 

For, from (10) and (13), 

w == e“ 2 ojyC-^^r = e<^Z, 


^ ~ 2 


hence 




,, “ = l0gM—log^. 

(8)7(9U17V*’ ealeulation, from (19), (20) with 


u = 


aiog^ 
dp ’ 




( 25 ) 
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and du= — ^f^da^+Tds, (26) 

a 

The simplest thermodynamical function, from which all 
.others can he derived by differentiation, is the free energy, 

f=u~Ts=^ —JcTlogZ, 

df^ -J^fada^-sdT^ 

a 


(27) 


hence 


fa 


da„ da^ 


while s = —dfjdT leads back to the second formula (25). 

The application to ideal gases may be illustrated by the simp¬ 
lest model where each particle is regarded as a mass point with 
coordinates x, y, z, momenta py, p^, and mass m. Then, 
according to Liouville’s theorem, one has to take as cells co^ 
elements of the phase space dxdydzdp^dpydp^ and replace the 
sums by integrals. The energy is {p%-\-py-\-p%)l^m. Then the par¬ 
tition function (24) becomes 

Z — j j dxdydzdp^ dpy dp^. 

The integration over the space coordinates gives F, the volume. 
If one puts one has 

J2m\% 


Z 




7 /// 


didrjdCy 


the integration extended for each variable from —oo to + 00 . 
The integral is a constant which is of no interest as all physical 
quantities depend on derivatives of Z. Hence, with = {kT)~^, 

f = —IcTlog Z = —JcT log V —f/^Tlog/cT'-l-const., 
from which one obtains 


P 


■n 


3f nhT 


s = 


_ 


hlog FH-p(log kT+1), 


dV V ’ " dT 

u = f-l-Ts = const. 

These are the well-known formulae for an ideal monatomic gas: 
Boyle’s law, the entropy and energy per atom. The specific 
heat at constant volume is 


dU 

dT 


^nh = f jK, 


if n refers to one mole. 
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15. (VI. p. 54.) The method of mean values 

The method of Darwin and Fowler aims at computing the 
mean value of any quantitydepending on the occupation 
number of a cell for all possible distributions 
satisfying the conditions 


T 7 * 

that is, the quantity 

where P is the probability of the distribution %, defined 

by equation (6.15) or 14, (1). 

We consider the function jF(z) defined l^y (6.22), 

F{z) = C 0 iZ^^-i-C 02 Z^^-i-..,-{-ct>^Z^^, (3) 

and assume that a very small unit of energy is chosen so that all 
t e Cy are positive integers, which may be ordered in such a way 

^ ^ ^2 ^3 ^ also, by choosing the zero of energy 

smtably we can arrange that cj = 0. 

Then we expand {F{z)y>' into powers of z according to the 
m momial theorem and obtain a series of terms 
n\ 

= P(Wi,M.2,..., 

these terms with the same factor we obtain all 

u^A'^) which belong to the same value of 


r 


way theToS]^nr^K^K*r5^^ in (2) and obtain in this 

given total energy ^,t the fSmf ^ 


= coefficient of in {P'(z)} 


/n 


theorem., if . is 


27ri J ^ (4) 

^here the integral is taken round a closed contour surrounding 
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tlie origin 0 in the 2:-plane. The integral can be evaluated 

by the method of steepest descent which we have 
already explained, for real variables, in Appendix 14, for the 
F-function. 

The first step is to express the integrand in the form 

0{z) =: nlogF{z)—{U-\-l)\o%z. 


Since 



and lim\ 


w 




both log F(z) and its derivative increase monotonically from a 
finite value to cx) as 2 : moves along the real axis between 0 and 
CO- Also —logs: and its negative derivative z~^ decrease mono¬ 
tonically along the same path. Hence Q{z) can have only one 
extremum, a minimum, on the real axis between 0 and 00 , and 
this minimum will be extremely steep if n and U are large. 

Also let Zq be the point of the real axis where the minirmiTn 
happens to be; then at this point the first derivative of G{z) 
vanishes and the second is positive and very large. Hence in the 
direction orthogonal to the axis the integrand must have a very 
sharp maximum. If we take as contour of integration a circle 
about 0 through Zq, only the immediate neighbourhood of this 
point will contribute appreciably to the integral. 

The minimum Zq is to be found as root of the equation 


G^{z) = 


Z7+1 F'(z) 

z + 


0 


(5) 


and one has 


G"{z) 




This shows that for large U and n a proportional increase in U 
and n will not change the root Zq, while G"{zq), which is positive, 
can be made arbitrarily large. 

5131 
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Putting 2 = one obtains for the integral (4) 

OO 

y p = r dy, 

(.U) 277^ J 

— OO 

where the terms of higher than second order are omitted and the 
limits of integration are taken to be d=°o because of the sharp 
drop of the exponential function. This gives 


(t/) 




( 6 ) 


^{27tO"{Zq)} 

XJ-\-l can be replaced by TJ^ because of the smallness of the energy 
unit chosen: if one puts 

(7) 

= (S) 


Zq — e-P 


one has, for N -> ao. 


which shows that the function F(z) is equivalent to the partition 
function introduced in (14.24), p. 158. 

If one now takes the logarithm of (6) the leading terms are 

log 2 -P = ^Uy-n log 

CT7) 


On the other hand, one has from (5) to the same approximation 


TT P'M 


n dZ 


Z d^ 

in agreement with (14.25); hence 


n 


dlog Z 
~d^- 


(9) 


log^|^P = «(log2-;S^^|^V (10) 

Comparison with (14.25) shows that the entropy in this theory 
is to be defined by 

S — ns ~ k\og,'^ P, (11) 

(Z7) 

while m the Appendix, 14, the definition was S ~ Jc log P, where 
P means the maximum value of the probability. 

Thus it becomes clear that owing to the enormous sharpness of 
the maximum it does not matter whether one averages over all 
states or picks out only the state of maximum probability. 
In fact, the two methods, that of the most probable distribution 
^d that of mean values, do not differ as much as it appears. 
Both use asymptotic approximations for the combinatorial 
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quantities: either for each factorial in the probability before 
averaging, or for the resultant integral after averaging. The 
results are completely identical. Yet there are apostles and dis¬ 
ciples for each of the two doctrines who regard their creed as the 
only orthodox one. In my opinion it is just a question of training 
and practice which formahsm is more convenient. The method 
of Darwin and Fowler has perhaps the advantage of greater 
flexibility. The partition function is nothing but a ‘'generating 
function’ for the probabilities, and allows the representation of 
these by complex integrals. In this way the powerful methods 
of the theory of analytic functions of complex integrals can be 
utilized for thermodynamics. 

16. (VI. p. 66.) Boltzmann’s collision integral 

The coUision integral (6.24) can be derived in the following way. 

The gas is supposed to be so 
diluted that only binary en¬ 
counters are to be taken into 
account. Then the relative 
motion of two colliding particles 
has an initial and a final straight 
line asymptote. 

To specify an encounter we 
define the "cross-section’ as the 
plane through a point O with a 
normal parallel to the relative velocity —^2 of two particles 
before an encounter and introduce the position vector b in this 
plane. We erect a cylindrical volume element over the area db 
with the height —^ 2 ! then all particles in this element 

having the relative velocity —^2 pass through c^b in time 

dt. The probabihty of a particle 2 passing a particle 1 at O 
within the cross-section element dh is obtained from the 
product/(I) d'X.-^dZ,-^f{2) cZx 2 cZ ^2 replacing tZXg by the volume 
of the cylinder 1^^—^ 2 ! d,hdt, 

/(l)/(2)|?i-?2l dlydiL^dg^d-g^dt. 

Every encounter changes the velocities and removes therefore 
the particle 1 from the initial range. The total loss is obtained 
by integrating over all dh and d%^i 

dx^dg^dt JJ f{l)f(2)\g^-g^\dbdg^. 



( 1 ) 
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But there are other encounters such that the final state of 
particle 1 is in the element they are called inverse 

counters. 

If the final velocities of the direct encounters are ^2 
laws of collision (conservation of momentum and energy) al 
one to express ^2 in terms of l^wo further paramei 

(the components of b in the cross-section plane). These relati 
are linear in ^2 and may be shortly written 

(ri, ?2) = 

where ^ represents a 6 x 6 matrix. It is obvious that the s< 
tions of these equations for and in terms of and ^2 1x1 
have the same form; that means that so that = 

and j = 1 or 

Further, the elementary theory of collisions (conservation 
energy and momentum) implies 


l?i-?2l = |?X-?2 
Hence the number of inverse encounters is 

ff /'(l)/'(2)|5i-5,| dbd^^, 

where / (1) means being the linear funotion 

Si, ^2 given by ( 2 ). 

obtains for the total gain of* p 
tides (l)mdx^ per time-element dt, 

d^^d^^dt fj {/'(1)/'(2)-/(1)/(2)}|^,_5^| dbd^,. 

calculated witlxc 

^ dx,d?,dt = d^^d^^dt. 

^e results are the combined formulae (6.23) and (6.24) of t 


8 /( 1 ) 

dt 


-W/(l)] = ff {/'(l)/'(2) 


—/(l)/(2)}|?i—dbd^s. 
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17. (VI. p. 57.) Irreversibility in gases 

Assuming no external forces, the Hamiltonian of a particle 
1 

is jET = ^—^2 ]gQl^ 2 ;mann’s equation (6,23), or (Appendix, 

16.8) reduces to 

( 1 ) 

If now the entropy is defined by (6.25) or, using the velocity 
instead of the momentum, by 

/(l)log/(l) (2) 

one obtains 

^ = -fc JJ {l+log/(l)}§^d!x,£Z?„ 

and substituting df{l)ldt from (1) 

M {l+log/(l)}{/'(l)/'(2)-/(l)/(2)}x 

X\%^-TiAdhdic^d%^d%,. (3) 

Here the first integral can be written 

and transformed, by Gauss’s theorem, into a surface integral 
over the walls of the container, 

jdcy 

where v is the unit vector parallel to the outer normal of the 
surface, da the surface element. 

The inner integral is n times the mean value over all velocities 
of ^i.vlog/(l), where n = J /(I) is the number density. But 
this average vanishes at the surface which is supposed to be 
perfectly elastic and at rest, external interference being excluded; 
for the numbers of incident and reflected particles with the same 


J ?i.v/(l)log/(l)(i?i. 
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absolute value of the normal component of the velocity, 
will be equal. 

Hence there remains only the second integral in (3). This can 
be written in four different forms, namely, apart from the one 
given in (3), where the factor 14-/(1-) appears, three others where 
this factor is replaced by l+/(2) or 1H-/'(1) or 14 -/'( 2 ). Hor it is 
obvious that 1 and 2 can be interchanged as the integration is 
extended over both points in a symmetric way; and the dashed 
variables can be exchanged with the undashed ones as 

!ri-r2i = 

(see Appendix, 16.3, 4). Hence 

JJJJ {log/(l)+log/(2)-log/'(l)-log/'(2)}X 
x{/'(l)/'(2)-/(l)/(2)}|5i-?2| 


or 


dt 


h 

4 


X l?i —?2l 

How log ^ is positive or negative according as 
/(l)/(2) 

is greater or smaller than /(l)/(2); it has therefore always the 
same sign as /'(I)/'(2)—/(!)/(2), and one obtains 

^>0; (5) 

dt 


the — sign can hold only if 

/'(l)/'(2) = /(l)/(2) (6) 

or log/'(l)+log/'(2) = log/(l)+log/(2). (V) 

One can express this also by saying that 

log/(l)+log/(2) (S) 

is a colhsion invariant. 

The mechanics of the two-body problem teaches that there are 
only four quantities conserved at a collision: the three com¬ 
ponents of the momentum the total energy 

Hence log / must be a linear combination of these: 

log/= a—(^) 


or 
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This can also be written 

f == 

1 

jY and «! = a-f- 


( 10 ) 


where u ^ , -- jl - > o o 

mp * 2mp 

(10) shows that u is the mean velocity. For a gas at rest (in a 

fixed vessel) one has therefore u = 0, hence y == 0 and 

f = (11) 

This is the dynamical proof of Maxwell’s distribution law. 


18. (VI- p. 60.) Formalism of statistical mechanics 

As said in the text, Gibbs’s statistical mechanics is formally 
identical with Boltzmann’s theory of gases if the actual gas is 
replaced by a virtual assembly of copies of the system under 
consideration. Hence all formulae referring to averages per 
particle (small letters) can be taken over if the word ‘particle’ 
is replaced by ‘system under consideration’. One forms the 
partition function (14.24) or (15.8) F{z) = Z{p), z = e~^, and 
from that the free energy (14.27) 

f=-hTlogZ, ( 1 ) 

from which all thermodynamical quantities can be obtained 


by differentiation: 

dT" da„' 


( 2 ) 


This formalism includes also the case of chemical mixtures 
where the number of particles of a certain type is variable. One 
has to know how the quantity Z depends on these numbers; 
then the chemical potentials, introduced in Appendix, 8, are 
obtained by differentiating f with respect to the concentrations. 
We shall mention only the method of the ‘great ensemble’ which 
can be used in this case. 

In the theory of non-ideal gases the Hamiltonian splits up 
into a sum 

H p|+!7(Xi,X2,...,Xjv), (3) 

2m 


and the partition function into a product 

^ = J J exp| —(j5/2m)^2^p|} 


X J Jexp{~iSC7(Xi,...,x^)} dXi...dx^. 
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The first integral can easily be evaluated and gives 

~ {27TmJcTY^I^\ 


hence one has 
where 


^ = {2'nmkTf^l^Q, 


Q = ^ (3^) J exp{—Z7(Xi,...,Xiv)/^^} 


(4) 

(5) 


as in (6.35) of the text. 

The method of Ursell for the evaluation of this integral applies 
to the case where the potential energy is supposed to consist of 
interaction in pairs between the centres of the particles, 

<!),, = (6) 

i>j 

Then one can write 


where 


= n =n 

i>j %>j 


fi 


%3 


The product (7) can be expanded into a series 

= 1— 2/i3'+ 2./i3 


i>j 


i>J 

h>k 


(7) 

( 8 ) 

( 9 ) 


and the problem of calculating Q is reduced to finding the 
‘cluster integrals’ 

J...J j ...j (10) 

which are obviously proportional to V^~^, V^-^, _ Hence one 

obtains for QV~^ an expansion in powers of V~^ which holds 
for small interactions small implies small): 

a , ^ 


■■■) 


Q = V^( 1 
^ I Y ^ yz 

Then (1) and (4) give 

/ = -i:r/^log(2,Tm*r)+loge|, 


( 11 ) 


( 12 ) 


hence 


P 




8V 


j.rp^'^ogQ _NkTI A B \ . 

fcj. p I'*’ y~^ yz ‘’-y 


where .4 = a/iV', B = (a^— 2p)IN, That is the formula (6.36) 

given in the text. 

The actual evaluation of the cluster integrals is extremely 
difficult and cumbersome. The analytical properties of the power 
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series (11) have been carefully investigated by J. Mayer, and by 
myself in collaboration with K. Fuchs. The theory has been 
generalized so as to include quantum effects by XJhlenbeck, 
Kahn, de Boer. Here is a list of publications: 

H. D. XJrsell, Proc. Camb. Phil. jSoc. 23, p. 685 (1927). 

J. E. Mayer, J. Chem. Phya. 5, p. 67 (1937). 

J. B. Mayer and P. J. Ackermann, ibid. P- 74. 

J. E, Mayer and S. F. Harrison, ibid. 6, pp. 87, 101 (1938). 

M. Bom, Physica, 4, p. 1034 (1937). 

M. Born and K. Fuchs, Proc. Boy. Soc. A, 166, p. 391 (1938). 

K. Fuchs, ibid. A, 179, p. 340 (1942). 

B. Kahn and G. E. XJhlenbeck, Physica, 4, p. 299 (1938). 
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J. de Boer and A. Michels, Physica^ 6, p. 97 (1939). 
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W. G. MacMillan and J. E. Mayer, ibid. 13, p. 276 (1946). 
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19. (VI. p. 62.) Quasi-periodicity 

The state of a mechanical system can be represented by a 
point in the 6iV-dimensional phase space p, q, and its motion by 
a single orbit on a ‘surface’ of constant energy in this space. 
Following this orbit one must come very near to the initial point; 
the time needed will be considerable, in the range of observa¬ 
bility. This is the quasi-period considered by Zermelo. Yet 
there are much smaller quasi-periods if one takes into account 
that all particles are equal and indistinguishable; the gas is 
already in almost the same state as the initial one if any particle 
has come near the initial position of any other. Then the orbit 
defined above is not closed at all, yet the system has performed 
another kind of quasi-period. These periods are presumably 
small; I cannot give a mathematical proof, but it seems evident 
from the overwhelming probability of distributions near the 
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most probable one. Einstein has this quasi-period in mind. It 
is certainly extremely short in the scale of observable time- 
intervals, and one can therefore say that the representing “'point’ 
sweeps over the whole energy surface if this point is defined 
without regard to the individuality of the particles (i.e. if an 
enormous number of single points corresponding to permutation 
of all particles are regarded as one point). 


20. (VI. p. 63.) Fluctuations and Brownian motion 

The statistical conception of matter in bulk implies that 
spontaneous deviations from equilibrium are possible. There are 
several different types of problems, some of them concerned 
with the deviations from the average or fluctuations found by 
repeated observations, others with actual motion of suspended 
visible particles—^the Brownian motion. 

The simplest case of fluctuations is that of density, i.e. of the 
number of particles in a small part coF of the whole volume F. 
One has in this case two cells of relative size a> and 1 —a>, and the 
probability of a distribution is according to 

(6.15) or (14,1) 

/yi I 

TF(?^l) =—~—^-—cu”‘(l — a>)”'~”^^. (1) 

n^\{n —Til)! 

The expectation value of found by repeated experiments is 


^1 = 


= 2 = V —r 


n-j^n\ 


or 


ni==o 


{n —?^l)! 


co”^(l — ca) 


n—m 


n 


n 


Tlco 


(91—1)! 


— ^^(71—1)—(m—1) 


(«■! — !)!{(»* — 1) — K— 1)}! 

According to the binomial theorem this reduces to 


= tlco, (2) 

as might be expected. 

In order to calculate n\ we note that — 1) can be found in 
exactly the same way as namely. 


^i(^i 1) 


== 2 
m—0 

■n 

= Ji(re—1)0)2 y-_ 


X 


X a>’^^“2(l — oi>Y'ri-^)-{nx-2)^ 
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■whence ^i(^i—1) == —1)^0^. (3) 

Therefore 

n\ = n-^in^ —l)-[-^i == n{n —l)ct>2+?ia>, (4) 

so that the mean square deviation is 

= (72-1— n-^'^ — n \—(%)^ = ncxi — nco^ — na>{l —o)). (5) 

If ct> is a small fraction, one obtains the well-kno'WTL fluctuation 
formula for independent events 

— %. ( 6 ) 

This is directly applicable to the density fluctuation of an ideal 
gas and can be used to explain the scattering of light by a gas, 
as observed for instance in the blue of the sky (Lord Rayleigh; 
Atomic Physics, Appendix IV, p. 280). 

There are also fluctuations of other properties of a gas. As 
the state of a fluid is determined by t-vvo independent macro¬ 
scopic variables {p, V for instance), it suffices to calculate the 
fluctuation of one further quantity. The most convenient one 
is the energy. 

The following consideration holds, however, not only for ideal 
gases, but for any set of independent equal systems of given 
total (or mean) energy; it supposes only that the distribution 
is canonical. 

Then all averages can be obtained with the help of the par¬ 
tition function (14.24) or (15.8), 

= ^ = (T) 


In particular the mean energy is (14.25) 

2 <Ur /^/ 

r 

and the mean of its square 

_ 2 

^2 jr ___ _ ri- 


dlogZ 
dp ’ 


( 9 ) 
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Hence tlie mean fliictnation of energy 

_ ZZ"-~Z 

“ z^ 

or with (8) j 


'2 ^ 


du 

dp' 


( 10 ) 


If the mean energy is loiown as a function of temperature, 
hence of p, one obtains its fluctuation by differentiation. For 
example, for an ideal gas one has u = c^T = hence 

(Ae)^ = hT^ — {Jclc^)u^. Another appHcation, to the fluctua¬ 
tion of radiation, is made in section VIII, p. 79. 

If one wishes to determine the fluctuations of a part of a body 
which cannot be decomposed into independent systems, these 
simple methods are not applicable. 

Einstein has invented a most ingenious method which can be 
applied in such cases. It consists in reversing Holtzmann’s 
equation S = fclogP, (11) 

taking as a known function of observable parameters, and 
determining the probabihty P from it, 

P = (12) 

Assume the whole system is divided into N small, but still 
macroscopic parts and A% is the fluctuation of energy in one of 
them; then one has for the entropy in this part 

If the whole system is adiabatically isolated one has 

2 = 0. 

By adding up all fluctuations one gets for the entropy 

S (13) 


where the abbreviations 


l(d^S\ 


ii — Aui 


(14) 
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are used. According to the second law of thermodynamics one 
has for constant volume 


dS = 


dU 


dU ~ 


hence 

y = - 


2Jc[dU^), 


2k8U\T} 


± 11 ] 

1 8T\T} 


2k 8U 2kT^c^’ 


BT 


where — dUjdT is the specific heat for constant volume. 
Substituting (13) in (12) one obtains approximately 

P = 

hence the mean square fluctuation of energy in one (macroscopic) 
cell is 


OO 


(i\UiY = ft 


f if e-y(i di^ 

— OO 


OO 


/ e-y^i di, 

-OO 


A 

dy 


OO 


log J e-y^i dii 

— 00 


2y’ 


or (Az^7)2 = IcT^c^. (16) 

This result is formally identical with that for an ideal gas ob¬ 
tained above, yet holds also if is any function of T. 

In a similar way other fluctuations can be expressed in terms 
of macroscopic quantities. 

We now turn to the theory of Brownian motion which is also 
due to Einstein. His original papers on this subject are collected 
in a small volume Investigations on the Theory of the Brownian 
Movement^ by R. Fiirth, translated by A. H. Cowper (Methuen 
& Co. Ltd., London, 1926) and make dehghtful reading- Here 
I give the main ideas of this theory in a slight modification 
formulated independently by Planck and Fokker. 

Let f{x, t) dx be the probability that the centre of a colloidal 
(visible) particle has an a;-coordinate between x and x-\-dt at 
time t. The particle may be subject to a constant force F and 
to the collisions of the surrounding molecules. The latter will 
produce a friction-like effect; if the particle is big compared with 
the molecules, its acceleration may be neglected and the velocity 
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component in the ir-direction assumed to be proportional to the 
force I — (17) 


where B is called the 'mobilityApart from this quasi-oontimi- 
ous action, the collisions will produce tiny irregular displace- 
iii6iit»s wliicli csiin l 36 dcscribod by 3^ st/^tistical law, ns/incly, 

a fimetion <j>{x) which, represents the probabiUty for a 
particle to be displaced in the positive ic-direction by x during a 

small but finite interval of time t. 

Then one obtains a kind of coUision equation (which is simpler 
than Boltzmann’s in the kinetic theory of gases, as no attempt 
is made to analyse the mechanism of collision in detail): The con¬ 


vective increase of/(a;, t) in the time-interval 


T, T 


dt 


{ 


dt^dx^J 


is not zero but equal to the difference of the effect of the collisions 
which carry a particle from x-^ to x and those which remove the 
particle at x to any other place 


T 



00 

— 00 


— 00 

^{x) may be normalized to unity and the mean of the displace¬ 
ment and of its square introduced by 

oo OO GO 

J cj>{x) dx — ly J x<j>{x) dx — Aa;, J x^(l>{x) dx = (Aa;)^- 

— CO —QO —oo 

(19) 

Further it may be supposed that the range of <j>{x) is small; then 
one can expand/(a;—a;') on the right-hand side of (18) and obtain 
a differential equation for/(£c, ^) which, with (17) and (19), can 
be written 


\ T / dx 2 T Bx'^ 


( 20 ) 


Let us assume that the irregular action of the collisions is 
symmetric in a:, ^{x) = <t>{--x); then A^ = 0. 
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Consider first statistical equilibrium; then 


BF 




dx 2 t 






0. 


( 21 ) 


Now the coordinate x of the colloidal particle can be included in 
the total set of coordinates of the whole system, if a term —Fx 
is added to the Hamiltonian, so that the canonical law of distri¬ 
bution contains the factor = l/JcT. Hence the solution 

of (21) must have the form 


so that 




8x^ 



If this is substituted in (21), one finds 


( 22 ) 


D 


(Ax)^ 

..2r. 


B 

P 


7cTB. 


(23) 


We consider now the motion of the particles without an 
external field (F — 0), under the action of the collisions only. 


Then (20) reads 


dt 


D 


dx^ 


(24) 


This is the well-known equation of diffusion. Einstein’s main 
result consists in the double formula (23) which connects the 
mean square displacement with the coefficient of diffusion I) and 
with temperature and mobility. 

If the particle is known to be at a given position, say a: = 0, 
at i = 0, the probability of finding it at x after the time t is the 
following solution of (24): 


f{x, t) 


^{4:7TJDt) 


(25) 


the mean square of the coordinate, or the ‘spread’ of probability 
after the time t is found by a simple calculation: 


{Aa;(i)p = J t) dx — 2Dty (26) 

— CXD 

which for ^ = r is equal to the mean square displacement (Aa;)^ 
given by (23). 

These formulae can be used in different ways to determine 



176 


APPENDIX 


Boltzmann’s constant Tc^ or Avogadro’s number N = (where 
i 2 is the gas constant per mole), i.e. the number of molecules per 
mole. A static method consists in observing the sedimentation 
under gravity of a colloid solution; then W = — mg^ where m is 
the mass of the colloid particle, and the number of particles 
decreases with height according to the law ( 22 ), which now reads 


In order to apply this formula one has to determine the mass. 
Bor spherical particles m = ( 47 T/ 3 )r®p, where p is the density and 
T the radius. The mobility of a sphere in a liquid of viscosity rj 
has been calculated by Stokes from the hydrodynamical equa¬ 
tions, with the result , 


hence it falls under gravity, F — — mg ~ —( 47 T/ 3 )r®pg', with 
the velocity (17) ^ 

S = —BF=~^rK 

9 7 ) 

As $ can be measured, r can be found, if p and 77 are known, and 
finally m. 

Another method is a dynamical one. One observes the dis¬ 
placements AiTi, AiCg,..., of a single colloid particle in equal inter¬ 
vals T of time and forms the mean square (Aa;)^. Then using the 
same method as just described for determining the radius r, 
one finds B from (27) and then h from (23). 

In this way the first reliable determinations of N have been 
made. Among those who have developed the theory M. v. 
Smoluchowski has played a distinguished part, while the first 
systematic measurements are due to J. Perrin. 

A new and interesting approach to the theory of Brownian 
motion may be mentioned: J. G. Kirkwood, J. Chem. Phys. 
14, p. 180 (1946); 15, p. 72 (1947). 


21. (VI. p. 67.) Reduction of the multiple distribution 
function 

The total Hamiltonian of N particles can be split into two 
parts, the first being the Hamiltonian of N~ 1 particles, 

the second the interaction of these with the last particle: 


( 1 ) 
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where is the external potential on the particle i and the 
mutual potential between two particles i and j. 

Now we apply the operator xn equation for the total 

system 

% = [^iv./iv]- (2) 


From (6.40) we have, for q = JSf —1, 


Hence 


Xjv/n — /n- 


Xn 




N 


dt 


e 

at 


(Xiv/v) 


^fl 


N~1 


at 


and 

XnIBM = XA-[^fA-i./iv]+XN[(l/2»»)P<"»+®<^,/^]+ 


Here the first term on the right-hand side becomes 

XivC-^iv—ij/iv] “ {.^n—i^Xn/nIi “ i1j 

since does not depend on the particle N to which, the opera- 
tor Xn I’^fers. Further, 




V (~ _ V(N) ^fN \ _ Q 


for if the integration xn performed, the result refers to values 
of/^ at infinity of the and respectively, and these vanish 
as there is no probability for particles to be at an infinite distance 
or to have infinite velocities. 

If all this is substituted in (2) one obtains 




N~\ 


at 




(3) 


Repeating the same process with Xn-^* Xv— 2 >-*- obtains the 

chain of equations (6.44), (6.45) of the text. 


22. (VI. p. 68.) Construction of the multiple distribution 
function 

The fundamental multiplication theorem for non-independent 
events can be obtained in the following way. 

Any event of a given set may have a certain property A or 

6131 N 
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not, A. If B is another property we indicate by AB those events 
which have both the properties A and B. _ _ 

Then all events can be split into four groups AB, AB, AB, 
AB, with the probabihties Pab> PJb’ Pab- 
The probability of A is 

= P^+Pab- _ _ _ 

On the other hand, if >4 is known to occur, the cases AB, AB are 
excluded, hence the probability of B is 


or 


Pb^^) = 


Pab 

Pab~^Pab 


_ Pab 
Pa ' 


PaPb^^) == Pab^ 


( 2 ) 


which is the multiplication rule; it reduces to the ordinary one 
for independent events if pp(JL) does not depend on A and is 
equal to 

This rule can be applied to a mechanical system of N particles 
in the following way. 

Let A signify that q particles are in given elements of phase 
space; the probabihty of A can be written 

Pa. = /a (3) 

Let B mean that the element g-f-1 is occupied. Then AB ex¬ 
presses that all q -\-1 elements are occupied, or 

Pab == /a+i (4) 


Hence p^{A), the probability for the element g-f-1 being occu¬ 
pied, if q particles are in given elements, is 


=SdS = fsM . (5) 

fa 

If this is summed over all possible positions and velocities of the 
last particle (g'H-1), the result is equal to the number of particles 
excluding the q fixed ones, JV— q; hence, with the normalization 
described in the text, (6.42) and (6.43), 

(N-q)f^ = J J /<,„dx<*+i)d§to+« = (6) 

which is the formula (6.40) of the text. 

In order to construct the equation (6.44) for the rate of change 
of fq, one has to introduce a generalized distribution function 
which depends not only on the position x and the velocity % 
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but also on the acceleration rj of the particles; the probability 
for a set of g particles to be in the element 

shall be denoted by 


g^{t, xw, Y]W xto), yjto)) d:x<%^(l)^^YJ(l)...dxto)d5(9)c^yJ(ff). 
One has obviously 

/ ® J g^drf>^...dn^<i> = /j. (7) 

Now the motion of the molecules follows causal laws; hence 
the probability/^ of a configuration in x, ^-space at a time t must 
be the same as that at the time of that configuration which 
is obtained from the first by substituting xW-{-S(»)S« and £(^>-4- 
for x^^^ and 

Hence (7) leads to 

J j* 5^, 7j) drp-K..dYf^^ 

/ ••• J = t)- («) 

The integration in the first two terms can be performed with 
the help of (7); that of the last leads to the integral 

/ J garimri<^K..dy{^) = (9) 

where the symbol vj^ is evidently the mean acceleration. 

Hence one obtains from (8) 



The final step consists in using the laws of mechanics for deter¬ 
mining yj®. The equations of motion are (force 



1 

m\-<w 

'd ““ X 



( 11 ) 


Now the function refers to the case where the positions and 
velocities of q particles are given, the others unknown. Hence 
one has to split the sum (11) into two parts, the first referring to 
the given particles, the second to the rest. For this rest the 
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probability of finding a particle in a given element g -j-1 is known, 
namely (fQ+Jfg) hence the average of this sum can 

be determined by integrating over i.e. by applying 

the operator Xg+i* fi^is way the mean acceleration is found to be 



1 



Substituting this in (10), one obtains 



0x(»> ' 




1 3<D(«) 

'3 = 1 

+-S 


.pco 4_ 


/a<I)(i,«+l) 





whichis easily confirmed to be identical with the formulae (6.44), 
(6.45) of the text. 


23. (VI. p. 69.) Derivation of the collision integral from the 
general theory of fluids 

From the standpoint of statistical theory a fluid differs from 
a sohd by the absence of a long-range order, so that for two events 
A and B happening a long distance apart one has, with the 
notations of Appendix, 22, = Pa-Pb^ ^^r instance, for large 

jx<2 )—one has/ 2 (x^^^ x^2>) =/i(x(^>)/i(x(2^), while in solids 
this is not the case. 

The distinction between liquid and gas is not so sharp and 
may even he said to disappear above the critical state. However, 
if one is not specially concerned with these intermediate con¬ 
ditions there is a wide region where liquid and gas can be dis¬ 
tinguished by the extreme difference of density. From the 
atomistic standpoint this has to be formulated thus: 

The potential energy <l>(x^^\ x^’^) between two molecules at 
and x<?^ decreases rapidly with the distance between their 
mass centres, and (except in the case of ions, where Coulomb 
forces act) a distance r^, small by macroscopic standards, may 
be specified, beyond which the interaction may without error 
be assumed to vanish completely. In a liquid proper, there are 
many molecules wdthin this distance Tq of a given molecule; in a 
gas there are usually none, and the probability that there is more 
than one is very small, except near condensation. The neglect 
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of this small probability is equivalent to the assumption of 
'binary encounters’ in gas-theory. Green has shown that when 
this assumption is made, on taking ^ = 1 in the equations (6.44) 
and (6.46) of the text, 




(1) 

( 2 ) 


one obtains Boltzmann’s collision equations (6.46), (6.47). 

To prove this we first work out the expression using the 
definition (6.6) of the Poisson bracket and of the operator x> 
(6.39). ^ ^ ^ Qj; 




i/J 


ax<i) * 




(3) 


(see also 22.13). With the assumption of binary encounters/g 
can be expressed in terms of /j by using the mechanical laws of 
collision- 

Consider the motion of two molecules which at time t have 
positions x<^>, x<2)^ such that |x<2)_x(i>[ < and velocities 
^(i)^^( 2 )^ while at time Iq (<; ^), when the molecules were last at a 
distance from another, their positions and velocities were 
x^^^ and The configurational probability 

fS, x(i), x(2), ^<i), ^(2)) dx<i)^Zx(2)d!^<i)d^(2) 

must remain unchanged during the interval {tQ,t) as the motion 
follows a causal law; also, by Liouville’s theorem, the volume in 
phase space jg unaltered. Since, as explained 

above, molecular events in fluids which occur beyond the range 
of interaction must be considered independent, one has 

fs, xW, x(2>, ^a>, ^<2)) = /,(«„ x<^), ^<1>) A(^o, x<2), ^^2)). (4) 

Next one introduces an approximate assumption which is 
always made in gas-theory, that ^q, xj,^^, xj,^^ may be replaced by 
t, x^^^ and x<2^ on the right-hand side of (4) (but of course not 

As Vq is very small the resulting error is of 
microscopic order; nevertheless it is not without importance, 
for it allows small deviations from Maxwell’s velocity distribu¬ 
tion law (and other ‘fluctuations’), which would otherwise be 
unexplainable, as this law is a rigorous consequence of Boltz¬ 
mann’s collision equation in equilibrium conditions. 

It remains to calculate and in terms of and 
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r = which can be done by using the canonical equations 

of motion or their independent integrals (conservation of energy, 
momentum, and angular momentum). The resulting formulae 
are the same as used in Boltzmann’s theory (see Appendix, 16). 
The reduction of >^2 can be performed without making use of 
explicit expressions. One has only to remark that 

/2(^,Xa),x(2),^(l), ^(2)) 

now satisfies the equation 

[■^25/2] = (^) 

where -£4 = (6) 

is the Hamiltonian of the two particles which are considered to 
move independently of all the others. Kow (5) becomes 

(7) 

We integrate this over <^x<2)d^^(2); then the term with on 

the right-hand side vanishes, because there are no particles with 
infinite velocities. The other term, with becomes identi¬ 

cal with m/Si, according to (3), since 

ao _ ao 

ar 

Hence, with (4), 

== JJ (8) 

where the domain of integration over r may be limited by the 
sphere of radius Tq surrounding x^^^. 

This integration can be performed by imagining the sphere to 
be partitioned by elementary tubes parallel to the relative 
velocity ^^ 2 )—^(i). may then integrate, first over a typical 
tube specified by the cross-section radius b, perpendicular from 
the centre of the sphere to the tube (see Appendix, 16), and then 
over all values of b. At the beginning of the tube, where 

(?(2)-?(!)).r < 0, 

the interaction between the molecules is negligible, and the 
functions giving and ^[,2) in terms of ^<2), and r reduce to 
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and At the end of the tube the values and of these 
functions have to be calculated from the collision integrals, just 
as in Boltzmann’s theory. Thus one obtains 

Sx = JJ l5'«-5®l{/i(5<«')/i(r>')-/i(r>)A(5®)} dhd^^<‘\ (9) 

which is identical with (6.47) of the text and the collision integral 
in (16.8), 

This derivation is not more complicated than Boltzmann’s 
original one and is preferable because it reveals clearly the 
assumptions made. 


24. (VII. p. 72.) Irreversibility in fluids 

A rigorous proof of the irreversibility in dense matter from the 
classical standpoint seems to be very difficult, or at least ex¬ 
tremely tedious. Green has, however, suggested a derivation 
which, though not quite rigorous, is plausible enough and 
certainly based on reasonable approximations. 

It has to be shown that the entropy S defined by (7.1) never 
decreases in time, so that 


^ J J (l+logAv)^‘^xa)<Z?<«...dx<^d5W) ^ 0, 

( 1 ) 


if/^ satisfies the equation 


which expresses that one particle of unknown position and 
velocity is added to a system of N particles. 

If is the total potential energy between the N particles and 
that between the ith of these and the additional particle, 
one has 

dt ’ m 

i—1 ^ 


+ 


1 . 

m 


r f NT ^/ n+i 

J J di 


___ 




(3) 


If this is substituted in (1) the integrals of the first two terms 
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■vanish on transformation to surface integrals. In the last sum 
all terms contribute the same, as and can be assumed to 
he symmetric in regard to all particles. Hence 

dS JcN 


dt 


mN\ 


j J J (l+log /jv) X 


X (4) 

Now the reasoning follows very closely that of Appendix, 23, 
where (23.3) was transformed into the integrahle expression 
(23.8) with the help of the identity ( 23.7). 

For this purpose one introduces instead of the velocities of the 
two particles (1) and (A+l) appearing explicitly in (4) new 
variables, namely their total momentum m, two components 
of the relative angular momentum a, and the relative energy w, 

m = a = 4m(x<-^-^«— x^^)) A (?^+^>—^<^>), 

(5) 

and regards as a function of these, so that 

/w+i = X«.—, x<^+«, m, a, w). 

Then by direct differentiation it can be verified that 

^/v+i ^In+1 \ 


f (^/n+i 

^/n+i \ 

g^a^+1) 

1 

m\ 

S^(N+1) j 

0x(i) 








an equation similar to (23.7). 


If 


S/i 




g(J)(l,V+l) 


- Q^(X) taken from it and substituted in (4), the 

only term which does not vanish is found to be 

f “cA-l/"'" J 

X (5W+l)_^tt)) . -^^dx<«<i|®...dxW+Md5W+i). (6) 


m, a, w are parameters specif 3 dng the trajectories which would 
be followed by the particles numbered (1) and (Aq-1) if no other 
particles were present. Now one can apply the same reasoning 
as in Appendix, 23, partitioning the x^-^’+'^^-domain by tubes 
formed by the trajectories of (A-j—1) relative to (1), where 
m., a, w are constant, and one can perform the integration with 
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respect to first along such a tube, then over all values of 

the cross-section b. At each end of the trajectory where the 
interaction can be neglected, the function would 

factorize into provided no other particle were near to 

the particle (iV"+l). 

This is, of course, not the case; but it seems to be reasonable to 
assume that the factorization is at least approximately correct 
as the action of the rest will nearly cancel- This is the simplifica¬ 
tion made by Green. It is clear that it could be corrected by a 
more detailed consideration; but let us be content with it. 

Since the sphere around in which is effectively 

different from zero is of microscopic dimensions, the values of 
x<N-i-i) a,nd need not be distinguished, nor the instants when 
these points are reached. The initial velocities must, 

however, be determined from the actual final velocities from the 
‘conservation’ law, i.e. the definitions (5) for constant m, a, w;: 

= m(xWA^W4.x(^-f-i)^^(N+i))^ 

If the integration in (5) is performed as described, one obtains 


where instead of the centre x = KxW+x^-^+i)) is introduced. 
Here/i^+^^ meanswhich can be replaced, accord¬ 
ing to formula (6.40) of the text, by 




^ X(^+2)^ ^(^+2)) C?X<-^+2)d:^<-^+2). 


If this is introduced into (7) one has an integral over 2iV"-|-4 
variables, where the integrand contains the factor 
By repeating the procedure one can transform (7) into the 
expression 


dS _ k 
dt “"{(Ar™i)!}2 





X dhdxdx<-^\..dx^^dx^^+^K..dx^^^H%<^'^'>d%^^\..dg^^^\ 
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where F„ is the function obtained by replacing the variables 

X® and E«) in /v by x<^+-« and respectively. 

Now one can apply the same transformations as for gases. 

as explained in Appendix, 17, which lead from (1 • ^ ” 

exchanging the dashed and undashed variables, and exchanging 
the two groups (l,2....,iV) and (l+N, 2+N,..., 2N). As it is 
obvious 'blia'b til© integral is invariant for these changes, on© 
obtains 


dt {^{N —1)1}^ J J l/iv'-^jvi 

^ ^(1) j dhdxd-K^^^. ^ (8) 

which makes it clear that dS/dt is positive or zero, and that the 
latter happens only if 

The solution of this equation leads again essentially to the 
canonical distribution. I shall, however, not reproduce the 
derivation but refer the reader to the original papers: 


M. Bom and H. S. Green, Nature^ 159, pp. 251, 738 (1947). 

- Proc. Roy. Soc. A, 188, p. 10 (1946); 190, p. 466 (1947) ; 

191, p. 168 (1947); 192, p. 166 (1948). 

H. S. Green, ibid. 189, p. 103 (1947); 194, p. 244 (1948). 

The reader may compare this involved and, in spite of the 
complication, not quite rigorous derivation from classical theory 
with the simple and straightforward proof from quantum theory 
given in section IX. 

I wish to add an argument, also due to Green, which shows 
that once the increase of entropy is secured the distribution 
approaches the canonical one. The latter is given by 


A is the free energy and RJ the energy, given by 

i==l i=l 

11 ^*^ being the macroscopic velocity at the point x^^^ 
Let the actual distribution be 


( 11 ) 


/n — 


( 12 ) 
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one has 

where U is the internal energy, and the same holds, of course, for 
f%f, so that 

jjf^dxd% = 0, = 0. (13) 

Then 

==/^l<^g/A-+/iv(lc>g/S/^+l)4-— (14) 

Hence 

s — — ^ JJ j/^-iog/N+ZivC^^—^-^+i)•••! 

Here the terms linear in vanish in virtue of (13), and one 
obtains 7 * ^ r* f's 

This shows that an increase in the value of S requires a decrease 
in the average value of |/^1 and therefore an approach to the 
canonical distribution. 

25. (VIII. p. 75.) Atomic physics 

It seems impossible to supplement this and the following 
sections, which deal with atomic physics in general, by appen¬ 
dixes in the same way as before. The reader must consult 
the literature; he will find a condensed account of these things 
in m y own book A-tomic Physics (Hlackie & Son, Glasgow; 
4 th edition 1948), which is constructed in a similar way to the 
present lectures; the text uses very little mathematics, while a 
series of appendixes contain short and rigorous proofs of the 
theorems used. For instance, Einstein’s law of the equivalence 
of mass and energy is dealt with in Chapter III, § 2 , p. 52, 
and a short derivation of the formula e = mc^ given in A. Ph. 
Appendix VII, p. 288. Whenever in the following sections I 
wish to direct the reader to a section or appendix of my other 
book, an abbreviation hke {A. Ph, Oh. III. 2 , p. 52; A. VII, 
p. 288) is used. 
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26. (VIII. p. 77.) The law of equipartition 
If the Hamytonian has the form (8.5), or 


H = e+ZT', 


- = 


( 1 ) 


where B’ does not depend on one has for the average of € in a 
canonical assembly 

J ee~^^djpdq J* ^ 


J e-^^djpdq J e-^^di 




JcT' 


as aU other integrations in numerator and denominator cancel. 
Now this can be written 

00 


d 


dp 


logZ, 


Z 


-- f e-J^^di. 

— oo 


( 2 ) 


— OO 

If the integration variable ^ = VW2)f is introduced one gets 

Z = j3-tA, 

where 4 is a constant. Hence logZ = const. — |log/? and 

1 


2p 




(3) 


in agreement with (8.6). 

Jf calculus in quantum mechanics 

ai ure o matrix mechanics to deal with aperiodic motions 
contmuous spectra, was less a matter of concention I 

follows immediately ttc.- i ‘^ys. ■Aty, 557 (1926), which 

idea that p“vliorf m,«uf . ® publication. The 

general aoW on f^-on? - 

N. Wiener, JoMm Math ^ZT ’"y M. Born and 

36, 174 (1926) wh^ S'rlv r ’ /' 

^ h nere m particular operators of the form 


= lim — 
37—>"00 2 T 


1 ^ 

^ J* Q!(f} s)... 

~T 


ds 


were used. Here the kernel alt <i\ io ^ ‘ x- 

mel q[t, 5) IS a continuous matrix’, also 
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introduced by Dirac. This paper contains also the representation 
of special quantities by differential operators (with respect to 
time) which satisfy identically the commutation law between 
energy and time Et—tE = iH, 

Schr5dinger’s discovery, which was made quite independently, 
consists in using a representation where the coordinates are 
multiplication operators and the momenta differential operators, 
so that the commutation laws 

are identically satisfied. This opened the way to finding the 
relation between matrix mechanics and wave mechanics and 
to the later development of the general transformation theory of 
quantum mechanics which is brilliantly represented in Dirac’s 
famous book. 

The early development of quantum mechanics as represented 
in text-books has become rather legendary. To mention a few 
instances: the matrices and the commutation law [g,p] = 1 
which are traditionally called Heisenberg’s, are not explicitly 
contained in his first publication: W. Heisenberg, Z. f. Phys. 

33, 879 (1925); his formulae correspond only to the diagonal 
terms of the commutator. The complete formulae in matrix 
notation are in the paper by M. Born and P. Jordan, Z.f. Phys, 

34, 858 (1925). Further, the perturbation theory of quantum 
mechanics, traditionally called Schrodinger’s, is contained 
already in the next publication of Heisenberg, Jordan, and 
myself (quoted at the beginning), not only for matrices, but also 
for vectors on which these matrices operate, and not only for 
simple eigenvalues, but also degenerate systems. The only 
difference of SchrSdinger’s derivation is that he starts from a 
representation with continuous wave functions which he aban¬ 
dons at once in favour of a discontinuous one (by a Fourier 
transformation). 

28. (IX. p. 94.) General formulation of the uncertainty 

principle 

The derivation of the most general form of the uncertainty 
principle can be found in my book {A. Ph. A. XXII, p. 326). 
As it is fundamental for the reasoning in these lectures, I shall 
give it here in a little more abstract form. 
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We assnme that for a complex operator C = A-{-iB and i'fcs 
conjugate O* = A —iB the mean value of the product 0(7^ is 
real and not negative: 

(i) 

where the bar indicates any form of linear averaging, as described 
in the text. Then writing XB instead of B, where A is a real para.- 
meter, one has 

{A+iXB)iA-iXB) = A^+'WX^—niA, J5]A >0, (2;) 

where the abbreviation (9.4) 

\A,B] = ^(AB—BA) (3) 

is used- As the left-hand side of (2) is real and also the first two 
terms on the right, it follows that [A, B] is real. The minimuro. 
of the quadratic expression in A, given by (2), occurs when 

and it is equal to A^— ^ ’ ^ 0. 

4 £2 

_*2_ 

Hence > ?L[^. £] 2 . (4) 

ISTow replace A by J .—A and B by B — B. AbA,B are numbers 
and commute with A and B, the commutator [A, J5] remains 
unchanged. Putting, as in (9.2), 

8A = {{A~AYf, SB = {(5—J)2}4, 

one obtains from (4) the formula (9.3) of the text, 

8^.85 > I ( 5 ) 

and as [9",^)] == 1, especially (9.5), 

Sp.8g>|. (6) 

This derivation reveals the simple algebraic root of the uncer¬ 
tainty relation. But it is not superfluous at aU to study the 
meaning of this relation for special cases; simple examples can 
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be found in A. Ph. A. XII, p. 296, A. XXXII, p. 357, and in 
many other books, for instance, Heisenberg, The Physical 
Principles of the Quantum Theory. 

29, (IX.p.97.) Dirac’s derivation of the Poisson brackets 

in quantum mechanics 

It is fashionable to-day to represent quantum mechanics in an 
axiomatic way without explaining why just these axioms have 
been chosen, justifying them only by the success. I think that 
no real understanding of the theory can be obtained in this way. 
One must follow to some degree the historical development and 
learn how things have actually happened. Now the decisive 
fact was the conviction held by theoretical physicists that many 
features of Hamiltonian mechanics must be right, in spite of the 
fundamentally different aspect of quantum theory. This con¬ 
viction was based on the surprising successes of Bohr’s principle 
of correspondence. In fact, the solution of the problem consisted 
in preserving the formalism of Hamiltonian mechanics as a whole 
with the only modification that the physical quantities are to be 
represented by non-commuting quantities. 

If this is accepted, there is a most elegant consideration of 
Dirac which leads in the shortest way to the rule for translating 
formulae of classical mechanics into quantum mechanics. It 
starts from the fact that classical mechanics can be condensed 
into the equation 

= ( 1 ) 

which any function/(f, representing a quantity carried by 
the motion must satisfy. Here the Poisson bracket is used 

[•£ „-| = _ ^ 

^ SPr Sp, Sq, 

If (1) is to be generalized for non-commuting quantities, it is 
necessary to consider how the Poisson bracket should be trans¬ 
lated into the new language. 

Dirac uses the fact that these brackets have a series of formal 
properties, namely 

= —h.^]. = 0, 



( 3 ) 
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where c is a constant; further 


v} — [^i» '^]+[^25 v]y 

and finaUy 

^xVi\ = [fi. ViiV2+Vi[Si> ’Ja]- 


(4) 

(5) 


Here the factors are written in a definite order, though in 
classical mechanics this does not matter. We have to do so if we 
want to use these expressions for non-commuting quantities, 
and the rule followed is simply to leave the order of factors un¬ 
changed. 

The question is, What do the brackets mean in this case ? To see 
this, we form the bracket '^ 2 ] ^ ways, using the two 

formulae (5) first in one order, then in the opposite one. Then 

[^1 ^25 Vl V 2 ] = [^l> h ^2]^2+^l[^2> '71 V 2 ] 

= {[ily ‘ 7 i ]^2 + ‘ 7 i [^ 1 » ' 72 ]}^ 2 +^ 1 ® 2 » '>? i ]’ 72 +^ i [^ 2 » ^ 2 ]} 

= [^1> '72]l2+^l[^2» ^i]'72+^1 Vz] 

and in the same way 

^23 ' 72 I ~ [^1^23'7l]‘72"i“'7l[^1^2>' 72 ] 


— [^13 Vl]i 2 '72+ll[l23 '7 i]^2+^i[^13 ' 721 ^ 2 -h '71 ii[i 2 y ' 72 ]* 
Equating these two expressions one obtains 

Ki3 '7i](^ 2 72— 72 ^2) == (1^17i—7i ^1)^23 72]* (^) 

[^13 7i] must be independent of 72 ^^<3. vice versa, it follows 

^2 72 72^2 ~ ^E^23 72l> 

where A is independent of all four quantities and commutes with 
^ 1 7i 7i ^1 ^2 72—72 ^ 2 * Hence A is a number. That it must 

be purely imaginary, A = i9i, cannot be derived in such a formal 
way; but it follows from considerations like those used in the 
previous appendix, where it is shown that a reasonable definition 
of averages implies (28.3) that 


EI 37 J] = 


( 8 ) 


is real. Thus it is established that the Poisson brackets in 
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quantum theory correspond to properly normalized commu¬ 
tators. 

If one inserts in the classical expression (2) for ^ and rj a 
coordinate or a momentum, one finds 

= 0. DPr.i’J = 0= = S„, (9) 

where S^r = ^rs = ^ for r ^ s. 

The same relations (9) must be postulated to hold in quantum 
mechanics. In this way the fundamental commutation laws are 
obtained. 

30. (IX. p. 100 .) Perturbation theory for the density matrix 

We consider the problem of solving the equation 

= H = H^+V, (1) 

where the perturbation function F is small. 

The method is essentially the same as that used for the 
corresponding problem in matrix or wave mechanics. 

Assume that A represents the eigenvalues of a complete set 
of integrals A of Hq, so that [jETq, A] == 0 and Hq becomes 
diagonal in the A-representation; put 

A) = E, Jfo(A', A') = E\ while A') = 0 for A A'. (2) 

Introduce instead of p and F the functions o- and U given by 

p(A, A') = <7(A, 

F(A,A') == ?7(A,A')c(^/^^x^-^'>^ ^ ^ 

Then one has 

? {E—E')cr{X, X 

^ 3f/ 1 3^ j 

iHop—pJSaK^,X) = 

Hence the equation (1) reduces to 

SSar(A^) ^ (U<j—aU){\,X') 

I dt 

= 2{D'(A,A")cr(A",A')-a(A,A")i7(A",X')}- W 

A" 

Now assume that o- is expanded in a series 

cr = crQ-i-0'i~f'Cr2H“**-j 
O 


5131 


( 5 ) 



appendix 

where c„is <’i.‘^2>- 

of order 1,2,... in the perturbation. Then one oDtains 


hence 

where 


dcr^jX, A ) _ i ?7(A, A'){oro(A')—o‘o(^)}’ 

8t ^ 

cr,(A,A') = 


( 6 ) 


u(X, X') = i f V(\, X') dt^^j F(X, it. (7) 

It follows for the diagonal elements from (6) that 

oi(X,X) = 0. (®> 

The next approximation oj has to satisfy the equation 

^') = 1 y {17(X,X'')oi(X".X')-oi(X,X'')i7(X",X')} 

bt 

^ -m(X, X")i7(X", X'){ar„(X)-ao(X")}] 

= 2 ti(X'',X'){o„(X")-Oo(A')}- 

_„(X, {a„(X)-Oo(X”)}]. 

We need only the diagonal elements; for these one has 


= I; y 'u(X.X')u(X',X){cro(X')—Oo(X)}, 

which gives by integration 

o-2(X.X) = 2 l«(X,X')P{o„(X')-Oo(X)}, (9) 

A' 

since, according to (7), w(A, A') is hermitian, u(A, A') — u*{X', A), 
and vanishes for t = 0, 

It is seen from (3) that the diagonal elements of p and cr are 
identical; they represent the probability P{t,X) of finding tlie 
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system at time t in the state A. Now (6), (8), and (9) give, in agree¬ 
ment with (9.25) of the text, 

iP(i, A) = J^(A)-1- ^ «7(A, A'){P(A')—P(A)}+..., (19) 


where 


[2 

p J F(A, A')e-«*Xif-s')(^i . (n) 


When F(A, A') is independent of the time one can perform the 
integration, with the result 

1 11 _ fi-imxE-jEw 


/(A,A') = p|F(A,A')P 


Now the function 


1 

l—eiyt 

27T 

iy 


behaves for large t 


like a Dirac S-function, i.e. one has 


1 /• 11 _ pi’yt\2 

Ay 

if the interval of integration Ay includes y = 0 and if iAy ^ 1. 

Suppose that the energy values are distributed so closely that 
they are forming a practically continuous spectrum. Then one 
can spHt the index A into (A, E) and replace the simple summation 
in (10) by a summation over A and an integration over E\ the 
latter can be performed on the coefficients J{X,E\ X',E') with 
the result that the formula (10) is unchanged, if the coefficients 
are given by 

J-(A, A') = ^ |F(A,A')|^S(S-J?'), (13) 


which combines the equations (9.27) and (9.28) of the text. 

As mentioned in the text. Green has found a formula which 
allows one to calculate the higher approximations in a very 
simple way. This formula is so elegant and useful that I shall 
give it here, though without proof (which can be found in the 
Appendix I, p. 178, of the paper by M. Born and H. S. Green, 
Proc. Boy. Soc. A, 192, 166, 1948). Starting from the equation 
(7) or . 

'll = 117, (14) 



196 


APPENDIX 


where TJ is known for a given perturbation V by (3), one forms the 
successive commutators 

and from them the expansion 

- -^+~3r 


... ^^22 ? * * * ? 

(16) 

1 3w-24 I 
"T 1 •••* 

(16) 


If the initial condition = 0 for ^ == 0 is added, can be deter¬ 

mined by integrating this series term by term. 

Then one forms 


^34 — “^ 2^2 '^ 2 *^ 2 > *^36 — '^34 '^ 2 "^ 34 » 

and the expansion 

_ 1^^34 I 2^36 , ^^38 , 

“®“^+Tr+“ir+-’ 


(17) 

(18) 


from which one can determine so that u^ — 0 for i = 0 . 

The second suffix I in Uj^ has been chosen to indicate the power 
of V which is involved in the expression; one has — 0(F^*‘’^) 
and this decreases rapidly with h when F or ^ are small. This rule 
makes it possible to construct - 1 ^ 4 , in a similar manner. 

Then one has the solution of (4) 

from which p is obtained by (3). 

The explicit expressions for the expansion (5) of a are 

cTi = upq — PqU , 

0-2 = + Po — Po'^22i)» 

c^3 = Po'“®) + 

~h^{'W '('^22 Po Po '^ 22 ) ('^22 Po Po'^22)^}~l~i^('^23 Po Po '*^ 23 )’*-* * 

( 20 ) 

These formulae will be useful for many purposes in quantum 
theory. Concerning thermod 37 namics, the third-order terms will 
have a direct application to the theory of fluctuations and 
Brownian motion. The customary theory derives these pheno¬ 
mena from considerations about the probability of distributions 
in an assembly which differs from the most probable one. The 
theory described here deals with one single system with the 
methods of quantum mechanics (which allows anyhow only 
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statistical predictions); deviations from the average will then 
depend on higher approximations. It can he hoped that this idea 
leads to a new approach to the theory of fluctuations in quantum 
mechanics. 

31. (IX. p. 112.) The functional equation of quantum 

statistics 

The equation (9.37), 

P(.£7(i)-l-JS7(2)) === P^(A<i>)P2 (A<2)), 

where A<^> depends on but not on and A^^^ vice versa, is 
obviously of the form 

treated in Appendix, 13, and has as solution general exponen¬ 
tial functions; hence the distribution for all three systems is 
canonical. 

32. (IX. p. 113.) Degeneration of gases 

The theory of gas degeneration is treated in my book Atomic 
Physics, but in a way which appears not to conform with the 
general principles of quantum statistics as explained in these 
lectures. According to these one always has in statistical equili¬ 
brium canonical distribution, P = while the presenta¬ 

tion in A . Ph. gives the impression that, by means of a modified 
method of statistical enumeration, a different result is obtained. 
This impression is only due to the terms used, which were those 
of the earlier authors (Bose, Einstein, Eei'mi, Sommerfeld), while 
in fact there is perfect agreement between the general theory 
and the application to gases. A simple and clear exposition of 
this subject is found in the little book by E. Schrbdinger, 
Statistical Thermodynamics (Cambridge University Press, 1946). 
I shall give here a short outline of the theory. 

In classical theory an ideal gas is regarded as a system of inde¬ 
pendent particles. In quantum theory this is not permitted, 
because the particles are indistinguishable. If i/ri(x^^^) and 
i/f 2 (x< 2 )), or shortly ^i(l) and ^ 2 ( 2 )^ wave functions of two 

identical particles with the energies and Pg, the SchrOdinger 
equation for the system of both particles, without interaction, 
has obviously the solution i/rj^(l)^ 2 (^) with the energy P^-j-Pa; 
but as the particles are identical there is another solution 
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belonging to the same energy, namely Hence any 

linear combination of tbese is also a solution. Two of these, 
namely the symmetric one and the anti-symmetric one, 

2) = ^i(1M2(2)H-^i(2)^2(1), 

«/.,(!, 2) = »Ai(1¥2(2)-^i(2M2(1) ^ 

have a special property: the squares of their moduli, \iIjs\^ Q'lid 
\>Pa\^> unchanged if the particles are interchanged. One can 
further show that they do not ‘combine’, i.e. the mixed inter¬ 
action integrals (matrix elements) vanish, 

J ^^(1, 2)^^„(l, 2) = 0, (2) 

for any operator A symmetric in the particles. Hence they repre¬ 
sent two entirely independent states of the system; each state 
being characterized by two energy-levels of the single particle 
occupied, without saying by which particle. 

The same holds for any number of particles. If 
are the energies of the states of the isolated particles, the total 
system (without interaction) has not only the eigenfunction 
«Ai(l)’iA 2 ( 2 ).-^n(^) belonging to the energy but 

all functions P^i(1 )^2(2)...^^(?2'), where P means any permutation 
of the particles, hence also all linear combinations of these. 
There are in particular two combinations, the symmetric one 
and the antisymmetric one, 


2,..., n ) = ^ ■^^^i(1)v!^2(2)...^^(?i), n 

^aCls 2,..., w,) = 2 ±^^l(l)^2(2)-..^^('?t) > (3) 

(+ for even, — for odd permutations), ; 

which have the same simple properties as described in the case 
of two particles: remains unaltered when two particles are 

exchanged wMe ^ (which can be written as a determinant) 

■F'nrtW and \4,J^ remain unchanged. 

formuS'(^^ ® fact expressed by 

svSm^tnT/* and jft describe the state of the ii-particle 
'ridualitw ae LtvTlh ^ ® particles have lost their indi- 

having a definiteLerg^Iveh of particles 

Experiment has shown that this description is adequate for all 
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particles in nature; every type of particle belongs either to one 
or the other of these two classes. 

The eigenfunctions of electrons belong, in view of spectroscopic 
and other evidence, to the antisymmetric type; hence they vanish 
if two of the single eigenfunctions are identical, i.e. if two 

particles are in the same quantum state. This is the mathe¬ 
matical formulation of Pauli’s exclusion principle. ISTucleons 
(neutrons or protons) and neutrinos are of the same type; one 
speaks of a Fermi-Dirac (F.D.) gas. Photons and mesons, 
however, and many nuclei (containing an even number of 
nucleons) are of the other type, having symmetric eigen¬ 
functions; they form a Bose-Einstein (B.E.) gas. 

In both cases the total energy may be written 

E = TV-^ —|— 71/2, €'2 'i'" • • • ■ ^.q (“I) 


where are the possible energy-levels of the single particles 

and '?^l, integers which indicate how often this level appears 
in the original sum (where each Ej^ was attri¬ 

buted to one definite particle). 

The sum of these occupation numbers 

••• = 2 '^s “ ^ 

may be given or it may not. The latter holds if particles are 
absorbed or emitted, as in the case of photons. For a B.E. gas, 
including the case of photons, there is no restriction of the 
while for a F.D. gas each energy-value can only appear once, 
if it appears at all. Hence one has the two cases 


(B.E.) 0,1,2, 3,... 

(F.D.) = 1. 


Now we apply the general laws of statistical equilibrium, which 
have to be supplemented by the fundamental rule of quantum 
mechanics that each non-degenerate (simple) quantum state has 
the same weight. (This is implied by the equation (9.14) of the 
text which shows that the diagonal element of the density matrix 
determines the number of particles in the corresponding state.) 
As we have seen in Appendix, 14, it suffices to calculate the 
partition function Z (14.24), p. 158, with all a>g = 1, 




I 




( 7 ) 
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where the sum is to be extended over all quantum numbers 'n,g, 
which describe a definite state of the system. These are just the 
numbers introduced in (4), with or without the restriction (5) 
according to the type of particle. Introducing the abbreviation 




Zg == 


(8) 

one has 





Z = 

2 

= 2 2 ^2* — 

= n 2 

(9) 



ni 9^a 

S Ths 



The sum is easily evaluated for the two cases (6), 


(B-E.) 2^5* ~ ^ y ? 

S X 

(F.D.) 2 = 1+V 

s 

One can conveniently combine the results into one expression 

z = (lo) 


where the upper sign refers throughout to the B.E, case. 

This formula contains the theory of radiation, where the 
condition (5) does not apply. But it is more convenient to deal 
with the instance where (5) holds and to relax this condition in 
the final result. 

A glance at the original form (9) of shows that the condition 
(5) indicates the selection from (10) of those terms which are 
homogeneous of order n in all the z^. 

This can be done by the method of complex integration. We 
form the generating function 

/(o = n(iT^^sr^ (11) 


and expand it in powers of The coefi&cient of is obviously 
equal to the product (9) with the restriction (5). Hence we obtain 


instead of (10) 




( 12 ) 


where the path of integration surrounds the origin in such a way 
that no other singularity is included except C = 0- 

Bor large n this integral can be evaluated by the method of 
steepest descent. It is easy to see that the integrand has one and 
only one minimum on the real positive axis. 
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As in previous cases (see Appendix, 14, 15) the crudest 
approximation suffices. One writes the integrand in the form 

wWe _ -(«H-l)log J+log/(£), 

and determines the minimum of the function p( from 


g'a) = 

then one has to calculate and 

g-^l) 


^ii^+Alog/CO = 0; 


(13) 


n-^1 . 


^2 • ^^2 


log/( 0 , 


for the value of ^ which is the root of (13). To a first approxi¬ 
mation one finds 




-n—l 




’^{%ng"a)y 

IoqZ = —(ra+l)log5+log/(0—ilog{27Tgr"(J)}. 

Neglecting 1 compared with n and the last term (which can 
he seen to be of a smaller order), one obtains 

logZ = —nlog C+log/(0; (If) 


here ^ is the root of (13), where also n-\-l can be replaced by n. 
Now one gets from (11) 


log/(0 = 2 Tlog(lT^2;5). 


dC 4 '^TCZs' 

Hence, from (8) and (13), 



1 

ea+/3e«zp 1 


= n 


j a = —log 
1 ^ = 1/JcT. 


(15) 


(16) 


From this equation a (or Q can be determined as function of 
the particle number and of temperature. One easily sees now 
that the case where the number of particles n is not given is 
obtained by just omitting the equation (16) and putting a = 0 
or ^ = 1. Yet the equation (16) is not entirely meaningless now, 
it gives the changing number of particles actually present. 
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The mean number of particles of the kind 5 is obviously 

2 j leiog^. 


n. 


ni>nz... 


-fiE 


S e- 

hence, from (14) and (15), 


Zd^e^ 




= 




( 17 ) 


which confirms (16) with (5). This formula, for the B.E. case 
(minus sign), has been mentioned in VIII (8.20), where it was 
obtained by a completely different consideration of Einstein’s. 
In the same way, the average energy of the system is found to 

TV— dlogZ €g 

ea+^egZjZ 1 


- 2 ; 

B 




B, 


(18) 


in agreement with (4). 

These are the fundamental formulae of quantum gases, derived 
from the general kinetic theory. They are to be found in A.. BTi. 
Ch. VII, p. 197; in particular the fundamental formula (17) in 
§ 5, p. 224, for B.E., and in § 6, p. 228, for E.D. All further 
developments may be read there (or in any other of the 
many books dealing with the subject). I wish to conclude this 
presentation by giving the explicit formulae for monatomic 
gases, where the energy is e = p^l2m and the summation over 
cells is to be replaced by an integration over the momentum 
space. The weight of a cell is found, by a simple quantum- 
mechanical consideration, for a single particle without spin 
{A. Ph. Ch. VII, § 4, p. 215) to be 


cu = ^dxdydzdp^dpydp^. 

Hence, introducing the integration variable 

X = p f-^— = ^ 

^{2mkTy 

one obtains from (17) and (18) 


477-F 


OO 

{2mhTyi^ f - 

0 


(19) 

( 20 ) 
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which, are the quantum generalizations of the formulae given in 
Appendix 14 and reduce to them for a -> cxd. A detailed discus¬ 
sion would he outside the plan of this book. It need only be 
said that the F.D. statistics of electrons have been fully confirmed 
by the study of the properties of metals {A. PTi. Ch. VII. 7, 
p. 229; 8, p. 232; 9, p. 235; 10, p. 236; A. XXX, p. 352). 


33. (IX. p. 116.) Quantum equations of motion 

At the end of Chapter VI, which deals with the kinetic theory 
of (dense) matter from the classical standpoint, the statistical 
derivation of the phenomenological hydro-thermal equations 
was mentioned and reference made to this later Appendix, which 
belongs to quantum theory. This was done to save space; for 
the classical derivation is essentially the same as that based on 
quantum theory, and one easily obtains it from the latter by a 
few simple rules. 

The first of these rules is, of course, the correspondence of the 
normalized commutator [a, ;8] = ^(q:^—^ a) with the Poisson 


bracket 




/ dcx. 

t=i ' 


dcx. 




doc dp \ 
sp(i) 


( 1 ) 


if x^^^ and are the position and momentum vectors on which 
a and p depend. 

The second rule concerns the operator x? which in the text is 
described in words; expressed in mathematical symbols it is 


Xa*.. = JJ c^x<a>dx(«)'S(x(«)-xto>').... (2) 

It has to be interpreted classically to mean 

Xg... == JJ (3) 

Thus the classical operation J corresponds to 

J dx(3>'S(X<«>-xto>'), 

i.e. to substituting for x^®^' as stated in the text. 

In using the correspondence principle to proceed from classical 
to quantum mechanics a product otP may not be left unchanged 
unless a and p commute; in general one must replace ocP by 
{pcP} = ^{oiPPoi)» 

By applying these rules one can easily go over from quantum 



204 


APPENDIX 


to classical formulae (and in many cases also vice versa). There¬ 
fore we give here only the quantum treatment. 

To derive the equation (9.48) from (9.46) with the help of 
(9.47), one proceeds by steps of which only the first need be 
given, as the following ones are precisely similar. One has: 


Hence, applying the operation xn (^-4:6), one obtains, using 
)6^ Pn — Pn~i> 

(5) 

The middle term on the right-hand side is 

(P'^Pjv+/5jvP‘^)- (P'^W+PivP'^) -P'^} 

and vanishes on transformation to a surface integral, because 
there is no flow across the boundary at a large distance. Hence 
(5) reduces to the equation (9.48) with q — N —1, which com¬ 
pletes the first step. The following steps are of the same pattern. 

In order to make the transition from the ‘microscopic’ equa¬ 
tions of motion (9.48) of the molecular clusters to the macro¬ 
scopic equations of hydrodynamics, one needs first to define the 
density and macroscopic velocity in terms of the molecular 
quantities. The generalized ‘density’ which reduces to the 
ordinary number density for q — 1, is obtained as a function 
of the positions by writing x^^^' = {i — 1, 2,...,g) 

in the density matrix pg(x, x'). The macroscopic velocity 
for a molecule (i) in the cluster of q molecules whose positions are 
given is the average value of the quantity represented by the 

operator — 

m , 

where the bracket {...} indicates the symmetrized product, as 
introduced above, and the subscript x' = x the diagonal 
elements of the matrix. 
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By expressing (9.48) in the coordinate representation and 
writing one obtains the equation of continuity 


dt 


i—x 


(7) 


smce 




X 


— (P®Ps+PaP®)-P^lx'- 

and 

[OW), 

= [{cl)(lx(^)—x<?)|)-<l)(|x(^)^--x(5)'|)}p(x, X')]x'=x= 0 * 

Next, multiply (9.48) by the operator before and after, taking 
half the sum, and then write {j — 1, 2,...,q). 

The left-hand side evidently reduces to m —(^^g,u^^^). One has 

further 


^ [p<’>^{PaP«'}]^-^= -:^^-{P'«(P<%)k'- 


2m 


m 


X 


and {p<« [<&«» = - 

{P<«, Xa-Ki [<!>»»+«. Pa+i]}x-x= - 
Hence, if a tensor is defined by 


n 


d0iiQ+l) 


/ 


n 




d 


^ ax<^)~'"J 


_ \(.H) — ^ 

ax<^-) ‘ «^ ax<^) 

i==i j = i 


+ I 




a+i 


ax<'^> 


dis<^+^\ 


one has 


»»|(»"«0+ 5 ^ • [^{P<’'>(P®Pa)}x'-x+ = 0. (8) 

By using (7) one obtains 


a 

m 


i = l 
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or, i£ d I dt is the convective derivative — + T 
|K<)) = 

j=l 

Hence (8) may be written in the form 


(9) 


where 


here 


mrig 


dt 



d 

0X<?> 



p(ji) _ |(?0^ 

k«'« = -{p®(p<%)}x'=.x-»»«s"«’“? 

Vfh 

v(« = i p®-u«> fr 8(x<«-x<«') 
m ® 1 


( 10 ) 


( 11 ) 

( 12 ) 


is the relative molecular velocity referred to the visible motion. 

The equation (10) is the generalized equation of motion of the 
cluster of q molecules, which reduces to the ordinary equation of 
hydrodynamics when q = i. 

the generalized pressure tensor, is seen to consist of two 
parts and associated with the kinetic energy of motion 
and the potential energy between the molecules respectively. 

The diagonal element of the tensor 16^'^ is a multiple of the 
kinetic temperature defined by 

= im{v(^)(v<%)}x'=x- (1^) 

The equation of energy transfer can be obtained in the same way 
as the equation of motion by calculating the rate of change with 
time of 


34. (IX. p. 118.) Supraconductivity 

There exists a satisfactory phenomenological theory of supra¬ 
conductivity, mainly due to F. London; it is excellently pre¬ 
sented in a book by M. von Laue, where the literature can be 
found. {Theorie der Supmleitungi Springer, Berlin u. Gottingen, 
1947). 

Many attempts to formulate an electronic theory have 
been made, without much success. Recently W. Heisenberg 
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has published some papers (Z. f. Naturforschung, 2 a, p. 185, 
1947) which claim to explain the essential features of the pheno¬ 
menon. According to this theory every metal ought to be supra- 
conductive for sufi&oiently low temperatures. Actually the alkali 
metals which have one ‘free’ electron are not supraconductive 
even at the lowest temperatures at present obtainable, and it is 
not very likely that a further decrease of temperature will change 
this. There are also theoretical objections against Heisenberg’s 
method. 

A different theory has been developed by my collaborator 
Mr. Kai Chia Cheng and myself, which connects supracon- 
ductivity with certain properties of the crystal lattice and pre¬ 
dicts correlations between structure and supraconductive state, 
which are confirmed by the facts (e.g. the behaviour of the alkali 
metals). The complete theory will be worked out in due course. 

35. (X. p. 124.) Economy of thinking 

The ideal of simplicity has found a materialistic expression in 
Ernst Mach’s principle of economy in thought {Prinzip der 
Denic-Ohonomie). He maintains that the purpose of theory in 
science is to economize our mental efforts. This formulation, 
often repeated by other authors, seems to me very objectionable. 
If we want to economize thinking the best way would be to stop 
thinking at all. A minimum principle like this has, as is well 
known to mathematicians, a meaning only if a constraining 
condition is added. We must first agree that we are confronted 
with the task not only of bringing some order into a vast expanse 
of accumulated experience but also of perpetually extending this 
experience by research; then we shall readily consent that we 
would be lost without the utmost efficiency and clarity in think¬ 
ing. To replace these words by the expression ‘economy of 
thinking’ may have an appeal to engineers and others interested 
in practical applications, but hardly to those who enjoy thinking 
for no other purpose than to clarify a problem. 

36. (X. p. 127.) Concluding remarks 

I feel that any critical reference to philosophical hterature 
ought to be based on quotations. Yet, as I have said before, my 
reading of philosophical books is sporadic and unsystematic, 
and what I say here is a mere general impression. A book which 
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I have recently read with some care is El. Cassirer’s Determinis- 
mus und IndeterTninismus in der modernen Physih (Gdteborg, 
Elanders, 1937), which gives an excellent account of the situa¬ 
tion, not only in physics itself but also with regard to possible 
applications of the new physical ideas to other fields. There one 
finds references to and quotations from all great thinkers who have 
written about the problem. The last section contains Cassirer’s 
opinion on the ethical consequences of physical indeterminism 
which is essentially the same as that expressed by myself. I 
quote his words (translated from p. 259): ‘Erom the significance 
of freedom, as a mere possibility hmited by natural laws, 
there is no way to that '‘reality” of volition and freedom of 
decision with which ethics is concerned. To mistake the choice 


{Auswahl) which an electron, according to Bohr’s theory has 
between different quantum orbits, with a choice {Wahl) in 
the ethical sense of this concept, would mean to become the 
victim of a purely linguistic equivocality. To speak of an ethical 
choice there must not only be different possibilities but a con¬ 
scious distinction between them and a conscious decision 
about tnem. To attribute such acts to an electron would be a 


gross relapse into a form of anthropomorphism. .. .’ Concerning 
the inverse problem whether the freedom’ of the electron helps 
us to understand the freedom of volition he says this (p. 261): 
‘It IS of no avail whether causality in nature is regarded in 
the form of rigorous “dynamical” laws or of merely statistical 
laws. ... In neither way does there remain open an access to 
that sphere of “freedom” which is claimed by ethics’. 

My short survey of these difficult problems cannot be com¬ 
pared with Cassirer’s deep and thorough study. Yet it is a 
satisfaction to me that he also sees the philosophical importance 
o quan um t ©oiy not so much in the question of indeterminism 
but in the possibility of several complementary perspectives or 
aspec s m the description of the same phenomena as soon as 
^erent standpoints of meaning are taken. There is no unique 
image of our whole world of experience. 


me the deKvenng the lectures, gives 

^ ^ 7* express my thanks to those among my 
audience who came to me to discuss problems and to raise 
jee ions. One of these was directed against my expression 



e 


APPEISTDIX 


209 


‘observational invariants’; it was said that the conception of 
invariant presupposes the existence of a gronp of transformations 
which is lacking in this case. I do not think that this is right. 
The problem is, of course, a psychological one; what I call 
‘observational invariants’ corresponds roughly to the Oestalten 
of the psychologists. The essence of Gesfalten theory is that the 
primary perceptions consist not in uncoordinated sense im¬ 
pressions but in total shapes or configurations which preserve 
their identity independently of their own movements and the 
changing standpoint of the observer. Now compare this with 
a mathematical example, say the definition of the group of 
rotations as those linear transformations of the coordinates 
x^y,z for which is invariant. The latter condition 

can be interpreted geometrically as postulating the invariance 
of the shape of spheres. Hence the group is defined by assuming 
the existence of a definite invariant configuration or Oestalt^ not 
the other way round. The situation in psychology seems to me 
quite analogous, though much less precise. Yet I think that this 
analogy is of some help in understanding what we mean by real 
things in the flow of perceptions. 

Another objection was raised against my use of the expression 
‘metaphysical’ because of its association with speculative sys¬ 
tems of philosophy. I need hardly say that I do not like this kind 
of metaphysics, which pretends that there is a definite goal to be 
reached and often claims to have reached it. I am convinced 
that we are on a never-ending way; on a good and enjoyable way, 
but far from any goal. Metaphysical systematization means 
formahzation and petrification. Yet there are metaphysical 
problems, which cannot be disposed of by declaring them 
meaningless, or by calling them with other names, hke epistemo¬ 
logy. For, as I have repeatedly said, they are ‘beyond physics’ 
indeed and demand an act of faith. We have to accept this fact 
to be honest. There are two objectionable types of believers: 
those who believe the incredible and those who believe that 
‘belief’ must be discarded and replaced by ‘the scientific method’. 
^Between these two extremes on the right and the left there is 
enough scope for believing the reasonable and reasoning on 
sound beliefs. Faith, imagination, and intuition are decisive 
factors in the progress of science as in any other human activity. 
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